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QO (54) Title: XY1.0SYT.TRANSFERASE AND ISOFORMS THEREOF 

^ (57) Abstract: The invention relates to the isolation, purification and characterization of the enzyme xylosyl transferase (defined 
2 as "XT*). The invention describes for the first time that XT occurs in at least two isoforms ("XT-F, "XT-H'*). The invention re- 
lates furthermore to the recombinant cloning and expression of human and rat XT-I and XT-II and discloses their DNA and protein 
O sequences. The enzymes according to the invention can be used as therapeutic agents and as diagnostic markers, e.g. for the de- 
termination of enhanced proteoglycan biosynthesis, and as biochemical markers for determination of several pathological processes 
such as systemic sclerosis. 
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Xylosyltransferase and Isoforms thereof 



The invention relates to the isolation, purification and characterization of the enzyme 
xylosyltransferase (defined as "XT")- The invention describes for the first time that XT occurs 

5 in at least two isoforms ("XT-I", "XT-II"). The invention relates furthermore to the recombinant 
cloning and expression of human and rat XT-I and XT-II and discloses their DNA and protein 
sequences. The enzymes according to the invention can be used as therapeutic agents and as 
diagnostic markers, e,g, for the determination of enhanced proteoglycyn biosynthesis, and as 
biochenoical markers for determination of several pathological processes such as systemic 

10 sclerosis. 

Background of the Inyention 

Proteoglycans are polyanionic molecules widely expressed in animal cells and virtually every 
. tissue. These abundant molecules are present in the extracellular matrix and on the cell sur&ce 

15 and serve a wide range of fiuxctions. They are mcreasingly implicated as important regulators in 
many biological processes, such as extracellular matrix deposition; cell membrane signal 
transfer, morphogenesis, cell migration, normal and tumor cell growth and viral infection 
(Ruoslahti, 1989, j: BioL Chem 264, 13369-1372; Heroldera/., 1994, J. Gen. rirol 75, 1211- 
1222). Proteoglycans mediate diverse cellular processes Hirough interaction with a variety of 

20 protein ligands. In most of tiiese bindings electrostatic interactions with the glycosaminoglycan 
chains attached to the core protein are involved (Kjeilen & Lindahl, 1991, Anm. Rev, Biochem. 
60, 443-475), Thus, the biological activity of proteoglycans is intimately related to the 
glycosaminoglycan biosynthesis. 

25 The sulfated glycosaminoglycans chondroitin sulfate, heparan sulfate, heparin and dermatan 
sulfate are bound to the proteoglycan core protein by axylose-galactose-galactose binding 
region (Kjeilen & Lindahl, 1991, /.c). UDP-D-3cylose:proteoglycan core protein p-D- 
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various specips {Hoffincmn et cd., 1984, Connect Tissue. Res. 12, 151-164), and it was shown 
that the enzyme is secreted fix)m the endoplasmatic reticulum into the extracellular space 
together with chondroitm sulfete proteoglycans (KShnert et al*, 1991, Eur. J. Clin. Chem. Clin. 
Biochem. 29, 624-625; Gotting et al., 1999, X Invest. Dermatol. 112, 919-924). However, the 
5 processes resulting in the release of XT £ix»m the endoplasmatic reticulum or the Golgi 
compartments and the role of XT in the extracellular matnx are not yet known . 

As XT is the initial step enzyme in the biosynthesis of the glycosaminoglycan linkage region 
and as it is secreted into the extracellular matrix to a great extent, XT activity was proposed to 

10 be a diagnostic marker for the determination of an enhanced proteoglycan biosynthesis and of 
tissue destruction (Weilke et al., 1997, Clin, Chem, 43, 45-51). XT activities m the synovial 
fluid were found to be significantly increased in chronic inflammatory joint diseases (Kleesiek 
et al,, 1987, J. Clin, Chem, Clin, Biochem 25, 473-481). Recent studies have shown that serum 
XT activity is a biochemical marker for the determination of fibrotic activity in systemic 

15 sclerosis (Gdtting et al.,1999, l.c.; Gottmg et al., 2000, Acta Derm. Venereol. 80, 60-61.). 

Up to now there was no success to isolate or characterize xylosyltransferase (XT), however 
some methods were described to measure and determine the activity of said enzyme fix>m blood 
or body fluid samples of patients showing pathological effects such as scleroderma or chronic 

20 joint diseases (Stoohniller, 1972, J. Biol Chem, 247, 3525-3532). The samples were incubated 
with UDP-[^*C]xylose and an appropiate acceptor. The incorporated radioactivity indicated the 
amount of XT activity. Acceptors used so far are proteoglycans, silk fibrqin (Campbell et al., 
1984, Anal. Biochem. 137, 505-516) and several peptides (Bourdon et al., 1987, Proc. Natl 
Acad. Sci USA 84, 3194-3198). However, the all activity tests used herein did not allow precise 

25 determination of the lower XT activity in serum. A more specific acceptor protein is 

recombinant bikunin, the inhibitory component of human inter-a-trypsin inhibitor. Bikunin 
carries a single chondroitin, which is essential for the structure of the inhibitor. The chondroitin 
sulfate attachement site in the N-terminal region contains all elements responsible for 
recognition by XT. The complete recognition sequence is composed of the amino acids a-a-a-a- 

30 G-S-G-a-b-a, with a = E or D and b = G, E or D. This sequence was confirmed by 

determination of the Michaeliis-Menten (Km) constants for in vitro xylosylation of diflFerent 
synthetic proteins and peptides iising an enriched XT preparation firom conditioned cell culture 
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supematant of human chondrocytes. The constant was determined to be 22 ^M, which was 
decreased 9-fold in comparison to deglycosylated core protein from bovine cartilage (188 joM) 
(Brinkmannetai., 1997, J. Biol Chen 272, 11171-11175). With recombinant bikunin as 
acceptor, a sensitive assay was developed that allows precise determination of XT activity in 
5 human serum and other body fluids (see: Weilke et al., 1997, LcJ. Using this assay an increased 
xylosyltransferase activity was deteraiined in blood of patients with sclerotic processes of 
scleroderma, closely related to an elevated proteoglycan biosynthesis (Gotting et al., 1999j I.e./ 

The biosynthesis of glycosaminoglycans reqizires the coordinated action of a large number of 
10 glycosyltransferases. Isolation and cloning of these glycosyltransferases has been targeted for a 
long time, since the majority of these enzymes are only present in minute amounts. The 
structure and sequence of the glycosyltransferases involved in biosynthesis of the common 
glycosaminoglycan-protein linkage region has long remained unknown. Recent cDNA cloning 
of galactosyltransferase I (Okajima et al., 1999, J, Biol Chem. 274, 22915-22918) aad 
15 glucuruonyltransferase I (Ktagawa et al., 1998, J. Biol Chem. 273, 6615-6618) identified 2 of 
the at least 4 enzymes involved m synthesis of the GlcA|31-3Gaipi-3Galpl-4Xylpl-0-Ser 
structure. 

Isolation, purification and characterization of XT involved in biosynthesis of the common 
carbohydrate-protein linkage structure has been hampered by difficulties in obtainmg a 
sufficient amount of the source material. Since, as above mentioned, XT can be used as an 
additional biochemical maricer for the determination of sclerotic activity in systemic sclerosis 
and some mflammatory disorders, it is a real need for an isolated, highly pxirified XT which can 
be produced by recombinant methods, which is necessary for diagnostic and therapeutic 
purposes. Moreover, the knowledge of the cDNA sequence of XT allows to use it on gene level 
such as in gene diagnostic or gene therapy, 

Definitions 

• Above and below the term "XT" means xylosyltransferase (UDP-D-xylose:proteoglycan 
30 core protein prO-xylosyltransferase (EC 2.4.2.26) deriving from any origin and uicludes all 
possible isoforms, if not otherwise pomted out. The terai "hXT" has the meaning of human 
XT; tiie term "rXT" of rat XT. The temis "XT-I" , "XT-H", "hXT-I", "hXT-fi", "rXT-I2, 
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"rXT-n" mean the specific isoforms 1 and 2 of XT according to the mvention, wherein h 
and r have the indicated meanings. 

• Above and below tiie term "protein" meansa protein, a protein fragment or a peptide, if not 
otherwise explained. 

5 Above and below the term "recombinant protem" is defined as a protein which was 
produced by recombinant and biotechnological methods. 

• Above and below Ihe term "XT" or "XT protein(s)" has the meaning of (a) protein(s) 
deriving from any source, if not otherwise stated out, having the biological activity and / or 
ftmction ofJJDP-D-xyloseiproteoglycan core protein fi-D-xylosyltransferase, 

10 • Above and below the term "a protein having the biological activity of UDP-D- 

xyloserproteoglycan core protein 6-D-xylosyltransferase " is defined as a protein which has 
XT - identical or XT - like functions and activities and comprises XT itself and possible 
mutations, variants, isoforms thereof, including insertions, deletions and substitutions of one 
ore more amino acids. The term includes also fi:agments or longer forms of XT as well as 

15 dimeric or multimeric forms thereof showing XT functions. 

• Above and below Ihe term "isoform" means a second naturally occuixing enzyme having the 
same biological activity as a first naturally occurring enzyme, however diflfeamg by another 
amino acid sequence. 

20 Summary of the Invention 

It was found that XT can be purified 4,700-fold with 1% yield from serum-free JAR 
choriocarcinoma cell culture supernatant The isolation procedure includes a combination of 
ammonium sulfate precipitation, heparin affinity chromatography, ion exchange 
chromatography and protamine affinity chromatography. Amongst other proteins an unknown 

25 protein was detected by matrix-assisted laser desorption ionization mass spectrometry-time of 
flight analysis (MALDI-TOF) in the purified sample. The molecular weight of this isolated and 
purified XT protein was determined as 120.000 by SDS-polyacrylamide gel electrophoresis. 
The isolated protein was enzymatically cleaved by trypsin and endoproteinase Lys-C. Peptide 
fragments were sequenced by Edman degradation. Searches Avith the amino acid sequences in 

30 protein and EST databases showed no homology to known sequences. XT was enriched by 
immunoaffinity chromatography with an immobilized antibody against a synthetic peptide 
deduced from the sequenced peptide firagments and was specifically eluted with the antigen. In 
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addition, XT was purified alternatively with an aprotinin affinity chromatography and was 
detected by western blot analysis in the enzyme-containing firaction. 

Based on the'^partial amino acid sequence of their isolated and purified new enzyme (XT) 

5 derived from human JAR choriocarcinoma cell culture supernatant a novel cDNA was isolated 
according to the invention, encoding human XT-I enzyme using the degenerate reverse 
transcriptase-polymerase chain reaction method. The enzyme belongs to a novel family of 
glycosyltransferases having no homology to proteins of prior art 5'- and 3 '-RACE were . 
performed to isolate a novel cDNA fragment of 3726 bp with a single open reading frame 

10 encoding at least 827 amino acids with a molecular mass of 91 .000. The human XT-I gene was 
located on chromosome 16pl3.1 using radiation hybrid mapping, and extracts from CHO-Kl 
cells transfected with the XT-I cDNA in an expression vector exhibited marked XT activity. 
Furthermore, a new 3608-bp cDNA fragment encoding a novel protein of 865 amino acids was 
also isolated by PGR usmg degenerate primers based on the amino acid sequence of human XT- 

15 1. The amino acid sequence of this XT-II isoform displays 5S% identity to the human XT-L The 
XT-n gene is located on chromosome 17q21.3-I7q22^ and the exon/intron structure of the IS 
kb gene was determined. RT-PCR analyses of XT-I and XT-II mRNA from various tissues 
confirmed that both XT-I and XT-II transcripts are ubiquitously expressed in tiie human tissues, 
although with different levels of transcription. Furthermore, the cDNAs encodmg XT-I and XT- 

20 n &om rat were cloned. The deduced amino acid sequences of rat xyiosyltransferases displayed 
94% identity to the corresponding human enzyme. 

Therefore, it is an object of this mvention to provide the following subject-matters: 

• An isoform of UDP-D-xylose .-proteoglycan core protein fi-D-xylosyltransferaseQCT); 

25 • a protein comprising a sequence of said isoform or a fragment thereof, having the biological 
activity of XT; 

• a corresponding protein deriving from human or rat sources (hXT, rXT); 

• a corresponding protein isolated from specific human tissue, wherein said hXT has a 
molecular weight of 120.000 under SDS PAGE conditions; 

30 • a corresponding recombinant protein, wherein said protein is hXT-I comprising at least 827 
amino acids and having the amino acid sequence as depicted in Fig. 7B; 
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• a corresponding isofonn of hXT-I, termed as hXT-II, comprising 865 amino acids, 
exhibiting approximately 55% overall sequence identity to human hXT-I, and having, in 
more detail, the amino acid sequence as depicted in Fig. 8B. 

• a process for isolating and purifying a protein having the biological activity of human UDP- 
5 D-xylose:proteoglycan core protein 6-D-xylosyltransferase (EC 2.4.2.26), said process is 

characterized by the following steps: 

(i) culturing from human tissue showing an enhanced XT activity, preferably 
from JAR choriocarcinoma cells (ATCC HTB-144), and harvesting the cell 
culture supernatant, 

10 (ii) fractionated ammonium sulfate precipitation of the supematant of step (i), 

(iii) heparin afBnity chromatography of the precipitate of step (ii), 

(iv) ion exchange chromatography of the step (iii) product, and 

(v) afBmty chromatography ofthe step (iv) product, and optionally 
(vi a SDS-Polyacrylamide gel elektrophoresis of step (v); 

15 • recombinant forms of hXT and rXT, the corresponding DNA sequences (Fig. 7A, 8A, 9, 10) 
included as well as suitable e}q)ression vectors and e^qniession host cell systems; 

• antibodies directed against any of the above or below mentioned XT proteins and their uses 
in immunological assays and diagnostic tools for determining said XT proteins; 

• pharmaceutical compositions comprising a XT protein as defined above and below, 
20 optionally together with a suitable pharmacologically acceptable carrier, diluent or 

excipient; 

• uses of said XT proteins for the manufacture of a medicament for the treatment of XT 
relevant diseases and disorders, wherein the xylosyltransferase enzymes according to the 
invention can either be used directly as therapeutic drug in pathological situations where a 

25 deficioicy of said enzyme and its isoforais can be detected; 

• uses of said XT enzymes for the manufacture of a medicament for the treatment of diseases 
and disorders which are caused or accompanied by increased levels of said enzymes, e.g. in 
sclerotic diseases and chronic joint diseases, wherein said medicament is an inhibitor or 
antagonist of said XT proteins; 

30 • uses of said XT proteins as diagnostic markers for the di^osis of above- and below- 
mentioned diseases and pathological symptoms and uses of said DNA molecules as gene 
markers; 
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• methods of screening for compounds which are enable to inhibit the activity of said XT 
proteins according to the invention using known methods for detenmning ttie activity of 
XT. 

5 Detailed Description: 

General 

UDP-D-xylose: proteoglycan core protein fi-D-xylosyltransferase can be isolated and purified 
fi-om JAR choriocarcinoma cell culture. The isolated protein according to the invention is a 
single stranded polypeptide with a molecular weight of 120.000. The protein was enzymatically 

10 cleaved and eleven peptide fragments were sequenced by Edman degradation (see Examples). 
XT is present only in very small amounts in animal tissues but unlike other 
glycosyltransferases, more than 90% of XT is emiched in the medium of cultured cells. The 
highest secretion of XT activity was measured in JAR choriocarciaoma cell culture, in which 
sternal cartilage chondrocytes and 21 different human cell lines were examined To produce a 

15 highly enriched XT solution for the isolation of XT, JAR choriocarcinoma cells were adapted to 
hollow fiber culture conditions using a novel bioreactor (TECNOMOUSE) and Ultradoma-PF 
medium without serum addition as nutrient For purification of XT a combination of classic 
separation methods and new afGnity matrices was employed. Therefore, a heparin matrix was 
used as an affinity ligand for the XT. When £q)plied to immobilized heparin, XT was 

20 completely adsorbed at the matrix and the XT activity was eluted only with a high salt 

concentration after most contammating proteins have been removed fix)m the matrix. Protamine 
chloride is well-known as cationic activator for several sulfotransferases, so the effect of 
protamine chloride on the XT was investigated. An mcreased XT activity was measured when 
protamine chloride was added to the XT assay solution (see Examples), indicating an 

25 interaction of these arginine rich proteins with XT. Therefore, a protamine chloride affinity 
matrix was synthesized using an aldehyde activated perfusion mediinn as support. The 
interaction of XT with this affinity matrix resulted in a 13.6 times enrichment of XT. The 
protamine chloride affinity chromatography was the most efficient purification step during the 
isolation of the XT. Immobilized aprotinin, a Kunitz-type proteinase inhibitor, was found to be 

30 another appropriate affinity matrix for enrichment of XT, as it was able to adsorb XT 
quantitatively. Therefore, it was used for altemative purification of XT: Different lines of 
evidence showed that the isolated protein corresponds to the XT: (a) XT activity was enriched 
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using immobilized antibodies raised against a synthetic peptide deduced from the 120.000 
protein, and the XT activity could be competitively eluted with this peptide, (b) Immunoblot 
analysis of aprotinin affinity purified XT conesponds with the 120.000 protein. However, non- 
reducing and non-denaturating gel filtration chromatography with heparm affinity-purified XT 

5 fiom JAR cell culture supernatant shows an additional peak of XT activity at a molecular 
weight of approximately 500.000. XT is associated with proteoglycans. Treatment of XT with 
N-glycosidase F resulted in a decrease of the molecular weight from 120.000 to 1 16.000, 
suggesting that the XT is a glycoprotein. A comparison of the molecular mass of XT with other 
glycosyltransferases involved in biosynthesis of proteoglycans shows that the XT is larger than 

10 the other enzymes. Another difference is that nearly all proteoglycan glycosyltransferases are 
tightly bound to the membrane of the endoplasmic reticulum, whereas XT is secreted into the 
extracellular space. XT according to the invention contains like many glycosyltransferases a 
DxD motif, suggesting that this motif is involved in binding the metal-ion cofactor and the 
donator substrate (Gastinel et al., 1999, EhdBO J. 18, 3546-3557). A DxD sequence was also 

15 found in peptide 8 obtained from the enzymatically cleaved XT. 

The invention discloses for the first time the molecular cloning and expression of human and 
rat XT. Surprisingly, XT is found in at least two isomeric forms, which are termed according to 
the invention as XT-I and XT-II. Based on the amino acid sequence a novel cDNA yms cloned, 

20 which encodes a protein of at least 827 amino acids with a molecular mass of 91.000. A DxD 
motif was identified in the XT-I amino acid sequence using hydrophobic cluster analysis. This 
motif has been observed in many glycosyltransferases, and is involved in the coordination of a 
divalent cation in the binding of the nucieotide-sugar (Breton & Imberty, 1999, Curr. Opin, 
Struct. Biol 9, 563-571.). The translation initiation codon of the XT-I cDNA could not be yet 

25 cloned probably due to strong secondary structures in the 5'-region of the XT-I mRNA. The 
XT-I protein isolated from human JAR choriocarcinoma cell culture supernatant migrated on 
SDS-PAGE with a molecular mass of 120.000 and the protein size could be further reduced 
after N-glycanase digestion. These findings indicate that the cloned cDNA represents at least 
90% of the coding region of htmian XT-I. 

30 

Another cDNA was completely identified from human placenta RNA, which was similar but 
not identical to the hXT-I cDNA. The new cDNA encodes a protein of 865 amino acids wifli a 
molecular weight of 97.000. This novel protein termed XT-I has a proline-rich region located 
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near the amino-tenninus and the type n transmembrane topology characteristic of many other 
glycosyltransferases cloned to date. The hXT-II protein was similar to the human XT-I with an 
overall sequence identity of 55%. The similarity of the predicted amino acid sequences was low 
(<10%) at the amino-terminal end. However, laige stretches at the C-terminal region, where the 
5 catalytic domain was found to be located in glycosyltransferases, are very conserved with an 
identity of more than 80% in both proteins. These findings let conclude that the XT-II gene 
encodes another human xylosyltransferase, although the catalytic activity and the biological 
role of XT-II remain to be elucidated in detail. 

10 Since alterations in XT activity have been reported to be associated with fibrotic and sclerotic 
alterations of connective tissue (Gotting et al, 1999; Gotting et al, 2000, I.e.), the present 
findings provide molecular tools to study the ftmction and the regulated expression of human 
XT as well as the molecular mechanisms of these diseases. 

15 Production of XT 

JAR choriocarcinoma cells vMzh had been adapted to growth in tiie serum-firee Ultradoma-PF 
medium secreted XT activity into the cell culture siq>eniatant. During the e3q)onential grow& 
XT activity and total protein concentration in the supernatant of a traditional cell culture system 
(T-flasks) were determined as 0.2 mUA and 0.1 g/1, respectively. The cells were cultivated 

20 using three "Tecnomouse'*® bioreactor systems. Each bioreactor contained five culture 

casettes. About 10^ cells per culture casette were inoculated and medium probes of about 0.5 ml 
were taken every day to determine the viability of the cells and the glucose, and lactate 
concentration as well as the XT activity of the cell culture supernatant The cells amounted in 
the probes varies from 10^ to 10^ cells per ml with viability between 40 and 70%. XT 

25 production increased within three weeks after cell inoculation, reaching a plateau of 

approximately 30 mU/I. After three months the culture cassette was removed. Harvesting was 
carried out every two days, collecting 10 ml cell culture supernatant per culture cassette. The 
cell-free supernatant was collected and stored at -75°C. Mean XT activity La tiie supernatant of 
the high density culture was determined as 29.0 mU/1, while the total protein concentration was 

30 estimated as 4.8 g/1. In total 1 8.5 1 high density cell culture supernatant was collected and 
yielded 535.8 mU XT. 
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halation and purification ofXTfi'om cell culture supernatant 

XT was purified fonn 1 8.5 1 supernatant (equivalent to 2.000 1 normal cell culture supernatant) 
of serum-firee cultivated JAR choriocarcinoma cells to an apparent homogeneity of about 4700- 
fold purification. A summary of tiie purification steps for isolation of XT is shown in Table 1. 
5 XT activity of the crude supernatant was approximately 0.006 mU/mg protein. The purification 
method according to the invention comprises four, preferably five different steps. It is possible, 
that one step can be achieved more than once if necessary. 

Step 1: Fractionated ammonium sulfate precipitation XT of the ammonium sulfate precipitable 
fraction was dissolved in 0.46 1 buffer A with solubilization of 79.5% of the original activity. 

10 Step 2: Heparin affinity chromatography on POROS 20 HE 4 ml of XT-enriched solution from 
step 1 was loaded onto the POROS 20 HE column. XT activity was completely retained on the 
column. More than 70% of total protein passed through the column. Contaminating protein was 
eluted at a low NaCl concentration. 44% of the XT activity bound to the heparin matrix 
emerged at 0.5 M NaCl (Fig. 1 A). 

15 Step 3: Ion exchange chromatography on POROS 20 HQ 4 ml of the desalted XT-containing 
fraction from step 2 was loaded onto the POROS 20 HQ column equilibrated in buffer A. More 
than 98% of the XT activity bound to the resin. The column was ihsri eluted stepwise with NaCl 
in buffer A (Fig. IB). XT-containing firactions were collected. 

Step 4: AfBnity chromatography on protamine chloride The product of step 3 was desalted and 
20 concentrated using dia- and ultrafiltration. 1 00 ^1 of the protein solution was applied to the 
POROS protamme chloride column previously equilibrated with buffer A. Approximately 95% 
of the transferase activity bound to the column, whereas 75% of the contammating protein did 
not Additional proteins were eluted with buffer A containing low NaCl concentrations. 
Enzyme activity was eluted at approximately 0.15 M NaCl (Fig. IC). The enzyme activity 
25 was stable for at least 6 monlhs at -7S°C. 

Step 5: SDS-Polyacrylamide gel electrophoresis (SDS-PAGE) XT-containing flections firom 
step 1-4 were subjected to SDS-PAGE on a 4-12% gradient polyacrylamide gel (Fig. 2, panel 
A). Coomassie-stained protein bands were excised and characterized by MALDI-TOF mass 
spectrometry after tryptic digestion. The molecular weight of an unknown protein was 
30 determined as 120.000 (Fig. 2, panel B). 
Step 5 is an optionally step. 
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.... - ^ Table 1: 

Summary of single purification steps employed for isolation of XT firom 18.5 I of high density JAR 
cell culture supernatant 



Step 


Volume 


Total activity Total protein 


Spec, activity Purification 


. Recovery 




ml. 


10-3xunits 


mg 


10-3xunits^g 


-fold 


% 


JAR high density cell culture 












Supernatant 


18,500 


535.8 


89355.0 


0.006 


1 


100 


Anunonium sul&te 














precipitation 


460 


426.0 


8,937.8 


0.048 


8 


79 


Heparin affinity 














chromatography 


50 


108.3 


473.0 


0.229 


40 


35 


Ion exchange 














Chromatography 


5 


91.0 


43.1 


2.090 


348 


17 


Protamine afOhity 














chromatography 


1 


6.83 


0.24 


28.458 


4,743 


1 



5 Amino acid sequence analysis of XT. The MW 120.000 protein from the excised band was 
digested with trypsin and endoproteinase Lys-C. The pioteolytic fragments were separated by 
reversed-phase HPLC, and selected peptides were subjected to automatic amino acid 
sequence analysis; Table 11 shows the obtained 1 1 amino acid sequences determined by Edman 
degradation and mass spectrometry. 

10 Table II . X represent a not identified residue. The masses of three peptides were observed in the 
MALDI mass spectrum of the enzymatically digested MW 120.000 protein. Calculated mass values 
were obtained from the sequence obtained. 



Peptide 


observed mass 
(M + H^ 


calculated mass 


(l)ELGAK 






{2)ELLK 






(3)DMNFLK 






(4)IASPPSDFGR 


1045.5 


1045.5 


(5)LLLD 






(6)DFENVDNSNFAPR 


1524.7 


1525.7 


(7)PTFFAR 






(8)LQFSEVGTDXDA 






(9)ELGAVKPDGRL 


1152.6 


1153.7 


{10)ELLKRKLEQQEK 






(ll)LGLLMPEK 
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Immunochemical detection of XT 

Polyclonal antibodies against the synthetic peptide CSRQKELLKRKLEQQEK deduced from 
the peptides 2 and 10 of the enzymadcally cleaved MW 120.000 protein were covalently bound 
on POROS 20 PA. About 50% of the XT activity of an applied sanq>le was bound (Fig. 3, panel 

5 A) when a partially purified XT sample obtained by heparin affinity chromatography 

(purification step 2) was loaded onto the colunm. 58% of the adsorbed XT activity was eluted 
with 1 50 mM NaCl, and the rest was eluted with 12 mM HCl. Furthermore, the adsorbed XT 
activity was also eluted fi-om the solid phase when 100 ^1.(1 mg/ml) of the synthetic peptide 
was added to the mobile phase (Fig. 3, panel C). When immobilized preimmune serum was 

10 used as affinity matrix (negative control) no XT activity was adsorbed to or eluted firom 
the matrix (Fig. 3, panel B). The desalted XT fi^ction after heparin affinity chromatography 
(purification step 2) was loaded on an aprotinin affinity colxunn. The elution profile shows four 
major protein peaks (Fig. 4, panel I). 61% of the XT activity adsorbed to the aprotinin matrix 
emerged at 0.30 M NaCl and another 21% at 0.54 M NaCl. A single MW 120.000 band of the 

15 XT-containing firactions was detected by westan blot analysis with the polyclonal antibodies 
(Fig. 4, panel B), 

Determination of the molectdar weight of XT 

100 (xl of heparin afOnity purified XT was separated under non-reducing and non- 
20 denaturating conditions using a TSK G3000 SW column. Two XT activity maxima were 
detected at MW 500.000 and 120.000 (Fig. 5, panel A). The molecular mass of the MW 
120.000 protein was reduced about 3 % after N-glycosidase F digestion as shown by SDS- 
PAGE(Fig. 5, panel B). 

25 PCR-based cloning of human XT-I cDNA 

Based upon the amino acid sequence of 4 peptides degenerate primers were designed for 
cloning the XT-I cDNA (Figure 6). When primers SPPSl and Lysc-inv were used in a PGR 
with the first strand cDNA of SW1353 chondrosarcoma poly(A)'^ mRNA as template, a major 
band of approximately 690 bp was observed. After subcloning and sequencing a previously 

30 unknown DNA sequence was obtained from clone pCGl 14-29. PGR amplification with the 
primers DFl and a mixture of Inv2b and Inv2c resulted in a 1724 bp fragment, which was 
subcloned and sequenced. The deduced amino acid sequence of the clone pCGl 11-4 was 
identical to 6 of the sequenced peptides from human XT-I. The cloning strategy of rapid 
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amplification of cDNA ends (RACE) {Chenchik et al, 1996, BioTeckmques 21, 526-534) was 
" employed to clone the complete coding sequence of the XT-I cDNA. The largest DNA 
fragment obtained from the 3'»RACE reaction was 1.6 kb and consisted of a 3'-uhtranslated 

- -region of 1240 bp. The 5' RACE reaction was performed with different primers derived from 

5 the nucleotide sequence of XT-I cDNA, but the 5'-untranslated region of human XT-l cDNA 
could not be cloned using this method. All DNA fragments obtained contained just the known 
cDNA sequence and an additional 80 bp of coding sequence. Thus, a PCR-based screening 
approach using cDNA libraries as template was employed for cloning of the 5'-untranslated 
region. All DNA fragments obtained from fiie screening of 3 different human cDNA libraries 

10 stopped at the same nucleotide, indicating that stable secondary structures of the XT-I mRNA 
prevent the synthesis of cDNA of the 5' untranslated region during the reverse tratiscription 
reaction. However, the translation initiation codon of human XT-I could not be yet cloned. The 
combined cDNA of human XT-I contained 3726 bp (Fig. 7A) with a single open reading frame 
encoding at least 827 amino acids with a molecular mass of a least 91.000. The deduced amino 

IS acid sequence contained 3 potential N-glycosylation sites (Figure 7B). Analysis of the amino 
acid sequence using the hydrophobic cluster analysis (Gaboriaud et al, 1987» ) revealed tiie 
presence of a common DxD motif at position 1 82, which has been shown to be essential for 
binding nucleotide-sugars in glycosyltransferases . 

20 PCR-based cloning of human XT-II isoform cDNA , , . 

The degenerate primers PFF-sense and Lysc-invl which were designed upon the amino acid 
sequence of proteolytic cleaved peptides of human XT-I were used in a PCR reaction with first 
strand cDNA of placenta poly(A)* mRNA as template. The PCR amplification resulted in a 
minor band of 1 . 1 kb, which was cloned and sequenced. The determined nucleotide sequence 

25 was similar but not identical to the XT-I cDNA sequence, indicating that the fragment encodes 
a XT-n isofomi. To clone the complete coding sequence of the novel cDNA, the RACE 
strategy was employed. The cDNA finally obtamed was 3608 bp (Figure 8A) and contained a 
single open reading frame encoding a protein of 865 amino acids with a molecular mass of 
97.000. The 3'-imtranslated region is 850 bp and a 5 '-untranslated region of 149 bp was 

30 identified with an in-frame stop codon upstream of the ATG codon. A Kyte-Doolittie 

hydropathy analysis (Kyte & Doolittie, 1982, J. Mol Biol 157, 105-132) of tiie deduced amino 
acid sequence revealed one potential membrane-spanning region consisting of 16 hydrophobic 
amino acid residues at position 16 to 32, which appears to result in a type 11 transmembrane 
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orientation characteristic of many of the glycosyltransferases. The predicted amino acid 
sequence contains a proline-rich profile pattern from position 1 10 to 1 1 8 and 3 potential N- 
glycosylation sites (Figure 8B). The human XT-II isofonn protein exhibits 55% overall 
sequence identity to the human XT-I including stretches in the XT-II protein witii more than 
5 80% homology to the human 
XT-I. 

Cloning of rat XT-I and XT-II cDNA 

For amplification of the XT-I and XT-II cDNA from rat tissue gene-specific primers were 
10 designed based upon the nucleotide sequence of human XT-I and XT-II cDNA. Theses 

oligonucleotides were employed in PGR, 5' RACE and 3 ' RACE reactions -with rat liver cDNA 
as template and a 2593 bp cDNA fragment (Fig, 9) coding for rat XT-I was obtained, 5' RACE 
reactions using a diversity of gene-specific primers were performed for identification of the 5' 
end of the XT-I cDNA. All cDNA fragments obtained stopped at exactly the same nucleotide 
15 position as observed in the human XT-I cDNA, indicating that stable secondary structures of 
the XT-I mRNA inhibit cDNA synthesis of the 5' region. The XT-II cDNA fragment finally 
obtained after PGR amplification with rat liver cDNA was 2782 bp (Fig. 1 0) and included the 
entire coding region of rat XT-II. The deduced ammo acid sequences of XT-I and XT-II (SEQ 
IDs 6, 8) from rat each displayed 94% identity to the corresponding human xylosyltransferase. 
20 AUgnment of all amino acid sequences revealed the presence of highly conserved amino acid 
clusters in the central and carboxyteiminal regions of the proteins. 

Identification of the chromosomal localization of the human XT-I and XT-II genes 
The gene coding for human XT-I could be assigned to the short arm of chromosome 16 at the 
25 region 1 6p 1 3 . 1 . The radiation hybrid mapping using the Genebridge 4 radiation hybrid 
screening panel located the position of the PGR fragment 5.98 cR distal to the STS marker 
GHLC.GATA42E1 1 and 4.40 cR proximal to marker D16S499. The XT-II gene was identified 
on chromosome 17 at the position 17q21 .3-17q22 and was found to be located distal to the STS 
marker WI-1 1424 and proximal to the marker WI-143 15. 

30 

Expression of recombinant XT 

To prove the function of the cDNA product, a recombinant form of XT-I was generated by 
ftising the cloned XT-I to an aminoterminal peptide tag. The fiised protein was e3q)ressed in 
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CHO-Kl cells and absorbed from the medium by immunoprecipitatioii with mii-Xpress- 
antibodies and protein G agarose beads to eliminate endogenous XT activity. The enzyme- 
bound beads were used as an enzyme source and assayed for XT activity as shown in Table 3 
(see below). No detectable XT activity was recovered by the aflHnity purification from a control 

5 transfection sample. The substrate specificity of the recombinant XT-I was sunilar to that of the 
XT-I isolated from human body fluids and cell culture supematants. The recovered enzyme 
activity of the recombinantly expressed XT-I could be completely inhibited by addition of 250 
U of heparin. As specific inhibition of human XT activity by heparin has been demonstrated 
previously (Kleesiek et al, 1987, I.e.), these results clearly indicate that the expressed protein is 

10 the human XT. However, no enzymatic transfer of xylose to the acceptor peptides used in this 
study was observed when expressing XT-II fused to the aminoterminal peptide tag in CHO-Kl 
cells. 

To identify the XT-I reaction products, the bikunin peptide was labeled with [^"^q-D-xylose 
15 using the XT-I-bound beads as an enzyme source. The products were subsequently subjected to 
the linkage-specific digestion of the bound [^"^Q-D-xylose with a- and P-xylosidase and 
alkaline p-eliminadon. Incubation of the reaction products with P-xylosidase resulted in the 
release of 74% of the incorporated [^^C]xyIose, y^ereas only less than 4% of the peptide-bound 
xylose was digested after treatment with a-xylosidase. The alkaline cleavage of the O- 
20 glycosidic linkage between the xylose and the p-hydroxyamino acid serine in the presence of 
borohydride lead to the liberation of more than 97% of the enzymatically transferred 
[^"^CJxylose. In all the experiments performed no significant amount of [^'^Cl-D-xylose was 
incorporated without the addition of the bikunin peptide as acceptor. These results clearly 
indicate that a xylose residue was transferred to the hydroxyamino acid serine of the bikunin 
25 peptide through a P-linkage. In conclusion the expressed protein was identified as UDP-D- 
xylose:proteoglycan core protein p-D-xylosyltransferase (EC 2.4.2.26). 
Table 3: 

Xylosyltransferase activity of recombinant XT-I expressed in CHO-Kl cells, XT activity of the 
enzyme fractions using different acceptors and the inhibition of enzyme activity by addition of 
30 heparin is shown. The synthetic bikunin, L-APP and L-APLP2 homologous peptides have 

been previously proved to be good acceptors for XT mediated jqriosylation (Brinkmann et aL, 
1997; Getting et al, 1998). No XT activity was detected in samples from mock-transfected 
cells after affinity purification, n.d., not detected (detection limit, 20 nU/l). 
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Acceptor 



XTActrvitjy 



pCG227-Xr 



+ heparin 



Recombinant bikunin 

Bikunin peptide QEEGSGGGGQK 

L-APLP2 peptide SENEGSGMAEQK 

L-APP peptide TENEGSGLTNIK 

Peptide SGG 

Ciiondroitin sulfate A 

Chondroitin sulfate C 



2854 
463 
515 
492 
n.d. 
n.d. 
n.d 



n.d. 
n.d. 
n.d. 
n.d. 
ad. 
n.d. 
n,d. 



Tissue-specific expression of XT-I and the XT-U isoform 

The expression of XT-I and XT-II isofonn gene was examined in various human tissues using a 
RT-PCR based method with normalized first-strand cDNA. Each PGR yielded a single product 

5 witii predicted nucleotide lengths of 490 bp for XT-I and 717 bp for XT-II, although the amount 
of the amplified product varied (Figure 1 1). Amplification of XT-I and XT-II firagments 
resulted in a product visible by ethidium bromide staining after 36 cycles, whereas the DNA 
fi:agment corresponding to the abundant glyceraldehyde-3-phosphate dehydrogenase transcript 
wais visible after 20 cycles. The greatest abundance of XT-I expression was detected in the 

10 placenta, kidney and pancreas and only a very weak expression was detected in skeletal muscle. 
The greatest abimdance of XT-II isoform is expressed in the kidney and pancreas. 

Cellular distribution of XT activity in cultured CHO-Kl cells 

After an incubation period of 3 days cultured CHO-Kl cells were harvested and the XT activity 
15 was determined in the spent culture supernatant and the cell lysates. 92% of the total XT 
activity was found to be located in the cell culture medium (93.1 mU/10^ cells, SD 9.58), 
whereas only 2% was detected in the cell lysates (2.03 mU/10^ cells, SD 0.44). 6% of the total 
XT activity (5.82 mU/10^ cells, SD 2.18) was released from the membrane-containing fractions 
after addition of the detergence Triton X-100 indicating that less than 1/lOth of XT is bound to 
20 cellular membranes. 

Pharmaceutical and diagnostic use 

The xylosyltransferase enzymes according to the invention can be used directly as therapeutic 
drug in pathological situations where a deficiency of said enzyme and its isoforms can be 
25 detected. As pointed out above XT enzyme may be overexpressed m some diseases and 
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disorders. In these cases inhibitors and / or antagonists of XT-I and or XT-II can be used. Thus, 
the invention relates also to the use for the manu&cture of a medicament for the treatment of 
diseases which are caused by increased levels of said enzymes, wherein said medicament is an 
~ inhibitor or antagonist of xylosyltransferase. 

5 As mentioned above the protein according to this invention can be used as diagnostic means to 
evaluate pathological conditions. As used herein, the term "pharmaceutically acceptable carrier" 
means an inert, non toxic solid or liquid jfilier, diluent or encapsulating material, not reacting 
adversely with the active compound or with the patient. Suitable, preferably liquid carriers, are 
well known in the art. The formulations according to the invention may be administered as unit 

10 doses containing conventional non-toxic pharmaceutically acceptable carriers, diluents, 
adjuvants and vehicles which are typical for parenteral administration. 
The tenn "parenteral" includes herein subcutaneous, intravenous, mtra-articular and 
intratracheal injection and infusion techniques. Also other administrations such as oral 
administration and topical application are suitable. Parenteral compositions and combinations 

IS are most preferably adminstered intravenously either in a bolus form or as a constant fusion 
according to known procedures. Tablets and capsules for oral administration contain 
conventional excipients such as binding agents, fillers, diluents, tableting agents, lubricants, 
disintegrants, and wetting agents. The tablets may be coated according to methods weU known 
in the art 

20 Unit doses according to the invention may contain daily required amounts of the protein 

according to the invention, or sub-multiples thereof to make up the desired dose. The optimum 
therapeutically acceptable dosage and dose rate for a given patient (mammals, including 
humans) depends on a variety of factors, such as the activity of the specific active material 
employed, the age, body weight, general health, sex, diet, time and route of administration, rate 

25 of clearance, enzyme activity (units/mg protein), the object of the treatment, i. e., therapy or 
prophylaxis and the nature of the disease to be treated. 

Therefore, in compositions and combinations in a treated patient (in vivo) a pharmaceutical 
efiFective daily dose of the proteia of this invention (hXT, hXT-I, hXT-II) is between about 0.01 
and 100 mg/kg body weight (based on a specific activity of 100 kU/mg), preferably between 
30 0.1 and 10 mg/kg body weight. Accordiag to the application form one single dose may contain 
between 0.5 and 10 mg of hXT. 
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Short Description of the Figures 
Figure 1; Purification of XT 

Panel A. Heparin affinity chromatography on POROS 20 HE: The dissolved ammonium sulfete 

5 precipitate from JAR choriocarcinoma supernatant was applied to a POROS 20 HE column. 

After equilibration witii buffer A, the column was eluted stepwise with NaCl ( ). Protein 

elution was monitored at wavelength A280 ( ), and fractions of 38 ml were assayed for XT 

activity (■ ). The horizontal bracket indicates fractions collected for frirther purification. 
Panel B. Ion exchange chromato^aphy onsPOROS 20 HQ : The desalted and concentrated 

10 product from heparin a£5nity chromatography was loaded onto a POROS 20 HQ column. The 

colunm was washed with buffer A. Adsorbed proteins were eluted stepwise with NaCl ( ). 

Protein elution was monitored at A280 ( ), and fractions of 50 ml were assayed for XT 

activity (■). The horizontal brackets indicate the fractions collected for frirther purification. 
Panel C Protamine chloride affinity chromatography: The desalted and concentrated product 

15 from ion exchange chromatography was applied to a protamine chloride POROS column. After 
a washing step with buffer A, the column was eluted stepwise with NaCI 

( ). Elution was monitored at A280 ( ), and fractions of 6 ml were assayed for XT 

activity ( ■ ). The horizontal bracket indicates the fractions collected. 
Figure 2 SDS-PAGE ofXTfi-actions at various purification steps 

20 Panel A: XT fractions at varioxis purification steps were subjected to 4-12% 

gradient polyacrylamide gel for SDS-PAGE. Lane I JAR cell culture supernatant 

(crude material); lane II, ultrafiltration retentate; lane III dissolved ammonium 

sulfete precipitate; lane IV, protein eluted with NaCl from the POROS 20 HE column; lane 

K protein eluted with NaCl fix)m the POROS 20 HQ column; lane VI protein eluted wifli NaCl 

25 from the protamine chloride POROS column. Lane M, molecular size standard: myosin 

(200.000), phosphorylase b (97.000), bovine serum albumin (66.000), glutamic dehydrogenase 
(55.000), carbonic anhydrase (31.000), trypsin mhibitor (22.000). The gel was silver-stamed. 
Panel B: Collected XT fractions from protamine aflSnity chromatography were subjected to 
SDS-PAGE on a 4-12% gradient polyacrylamide gel. The arrows indicate the stained bands 

30 corresponding to the unknown protein, which was shown to be XT (120,000) (1), hexose-6- 
phosphate dehydrogenase (89.000) (2), ezrin (68.600) (3), quiescin Q6 (64.000) (4), 
plasminogen activator (47.000) (5), aldolase A (39.000) (6) and low molecular artefacts (7). All 
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protein bands were excised, enzymadcaliy digested, and the peptide mixture was characterized 
by MALDI-TOF mass spectrometry. The gel was stained with Coomassie Brilliant Blue. 
Figure 3: Immunoqffinity chromatography of xylosyltransferase 

Panel A. 100 jil of the desalted XT-containing fractions eluted from heparin affinity matrix was 
5 applied to a column of immobilized polyclonal antibodies. After washing with buffer D, the 
column was eluted as indicated by the arrow with buffer D / 0.15 M NaCl (1) followed by 12 
mM HCI (2). Fractions of 1 ml were collected into tubes containing 1 ml of 0. 1 M Tris/HCl, pH 

8.0. Protein elution was monitored at 280 nm ( ) and the XT activities (■) of each fraction 

were assayed. 

10 Panel B, Negative control of the immimoaffinity chromatography with immobilized preimmune 
serum. For conditions see Panel A. 

Panel C Immunoaffinity chromatography with immobilized polyclonal antibodies. The column 
was eluted with buffer D containing 1 00 |xl (1 mg/ml) of the peptide antigen indicated by the 
arrow. Fractions of 1 ml were collected into tubes containing 1 ml of 0.1 M Tris/HCl, pH 8.0. 

15 Protein elution was monitored at 280 nm ( ) and the XT activities (■) of each fraction 

were assayed. 

Figure 4; Aprotinin affinity chromatography of partially purified XT and 
immunoblot analysis of the separated fractions 

Panel A. 200 fil desalted XT fraction from the heparin purification step was applied to an 
20 aprotinin column previously equilibrated with buffer A: After washing with buffer A, the 

adsorbed proteins were eluted stqpwise with NaCl ( ). Protein elution was monitored at 

A280 ( ) and fractions of 2 ml were assayed for XT activity (■). 

Panel B, Aliquots of fraction 3, fraction 7, fraction 12 and fraction 16 were analyzed by western 
blot. The MW 120.000 protein was detected in the XT-containing fraction 7 and 12. 

25 Immunological detections were performed using a polyclonal rabbit antiserum raised against 
the synthetic peptide CSRQKELLKRKLEQQEK deduced from the peptides 2 and 10 of the 
enzymatically cleaved unknown protein. Prestained molecular size standard were myosin 
(190.000), BSA (64.000), glutamic dehydrogenase (51.000). 
Figure 5 : Gel filtration chromatography and N-glycosidase F digestion 

30 Panel A. 1 00 p.1 of the XT containing fraction from heparin aflSnity HPLC was applied to a 
TSK G3000 SW column (30 cm X 7.5 mm, 10 |im particle size) previously equilibrated with 
buffer A / 0. 1 5 M NaCl. Elution was performed with the same buffer. Fractions of 200 jil were 
collected and assayed for XT activity (■). Protein elution was monitored at A280 ( \ The 
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arrows indicate the elution positions of thyroglobulin (669.000) (1), ferritin (440.000) (2), 
aldolase (158:000) |3), albumin (67.000) ^4), ovalbumin (43.000) (5% chymotrypsinogen A 
(25.000) (6), and ribonuclease A (13.700) (T), 

Panel A An aliquot of 1 |xg of the MW 120.000 protehi was analyzed by SDS-PAGE 

5 (I). Aliquots (1 iiig) of tiie MW 120.000 protein were digested with 3,1 x 10 units of N- 
glycosidase F at 37**C for 1 h (II) and 12 h (III). The samples were then subjected to SDS- 
PAGE, and protein bands were detected by silver staining. The molecular size standard were 
myosin (200.000), b-galactosidase (1 16.000), phosphorylase b (97.000). 
Figure 6; Cloning strategy for human XT-I and XT-II isoform 

10 (A) The peptides 4, 6, 7, and 9 are sequences that were most favorable for the design of 
degenerate PGR primers. The amino acid sequence of the peptides was obtained after 
proteolytic digestion of the puriJBed human XT. The strategy for cloning of XT-I cDNA (B) and 
XT-n isoform (C) is illustrated. The open reading frame of XT-I and XT-II is shown as a filled 
box, and the location of the peptides 4, 6, 7 and 9 is illustrated by open boxes. The location and 

15 orientation of the degenerate primers employed for cloning of XT-I and XT-II are marked by 
arrows. XT-I and XT-II cDNA inserts contained within the indicated plasmids were obtained 
using RT-PCR with degenerate primers (pCGl 1 M, pCGl 14-29* pCGl 10-7), 5' RACE 
(pCGl 85-21, pCG212-19, pCG319.23, 3' RACE (pCG204-38, pCG21 M) and RT-PCR with 
gene-specific primers (pCG176-l).H = A + C + T,Y = C + T,R = A + G,N = A+G + C + T, I 

20 = deoxyinosine. 

Figure 7A. B: Nucleotide (A) and deduced amino acid (B) sequence of the hXT-L 
The position of the peptide sequences obtained by digestion of the purified human XT are 
underlined (B). Potential N-glycosylation sites are double underlined. The DxD motif is 
intensified depicted. The figures are identical with SEQ ID 1 and 2. 
25 Figure 8A,B; Nucleotide (A) and amino acid (B) sequence of the hXT-U isoform. 

Potential N-glycosylation sites are double underlined (B). The putative transmembrane domain 
is underlined. The figures are identical with SEQ ID 3 and 4. 

Figure 9: Nucleotide sequence of rXT-I (SEQ ID 5), The corresponding protein sequence is 
depicted in SEQ ID 6. 

30 Figure 10; Nucleotide sequence of rXT-U (SEQ ID 7). The corresponding protein sequence is 
depicted in SEQ ID 8. 

Figure 11 ; Differential expression of the XT-I, XT-II gene in human tissues. Semiquantitative 
RT-PCR with normalized first-strand cDNA was used to examine tiie abundance of XT-I and 
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XT*n transcripts. A 983 bp fragment of the housekeeping gene glyceraldehyde-3-phosphate 
dehydrogenase cDNA was amplified as control (A). The 490 bp XT-I cDNA fragment (B) and 
the 717 bp fragment amplified fit)m XT-II isoform cDNA (C) were detected in each tissue 
indicating that both enzymes are ubiquitously expressed. The arrows indicate the expected 
5 position for each PGR product 

The following examples describe the invention in more detail without to limit scope of the 
technical teaching. 

Example 1: Materials for Isolation and Purifications, Human JAR choriocarcinoma cells 

10 were purchased from ATCC (Rockville, MD). Dried UltraDOMA-PF medium was obtained 
from BioWhittaker (Vervier, Belgiimfi) and aqua ad injecta from Braxm (Melsungen, Germany). 
Heat-inactivated fetal calf serum, Dulbecco's phosphate-buffered saline, antibiotic/antimycotic 
solution, trypsin-EDTA solution, trypan blue, protamine chloride and the Bicinchoninic Acid 
Protein Assay Kit were purchased from Sigma (Deisenhofen, Germany). Cell culture 

15 flasks, serological pipettes, and sterile tubes were purchased from Beaton 

Dickinson (Heidelberg, Germany). The hybrid hollow-fiber bioreactor TECNOMOUSE® 
was si^^plied by Integra Biosciences O^emwald, Germany), the ACA analyzer by 
Dade Diagnostica (Mfinchen, Germany) and the Super G analyzer by RLT 
(MShnesee, Germany). UDP-[ C]xylose (9.88 kBq / nmol) came from DuPont (Bad 

20 Homburg, Germany), 25 mm diameter nitrocellulose discs from Sartorius (GOttingen, 
Germany), scintillation mixture and the liquid scintillation counter LSSOOTD was obtained 
from Beckman Coulter (Fullerton, CA). Ultrafiltration cells, YMl membranes and PVDF 
membranes (Immobilon P) were purchased from Millipore (Eschbom, Germany). 
The chromatography media POROS 20 HQ, POROS 20 HE2, POROS 20 AL, POROS 20 EP 

25 and the HPLC workstation Biocad Sprint were supplied by Perseptive 

Biosystems (Framingham, MA), The gel filtration column TSK G3000 SW (30 cm x 7.5 mm, 
I0\}m particle size) was obtained from TosoHaas (Montgomeryville, PA). The MALDI mass 
spectrometer Reflex n was from Bruker Daltonik GmbH (Bremen, Germany) and protein 
sequencer Procise 494 cLC was purchased from PE Biosystems (Framingham, MA). Precast 

30 polyacrylamide gels, buffers, and NuPAGE electrophoresis system XCell n Mini-Cell and Blot 
Module were from Novex (San Diego, CA). The synthetic peptide CSRQKELLKRKLEQQEK 
and the rabbit antiserum were purchased from BioScience (Gottingen, Germany). Peroxidase- 
conjugated affinipure F(ab')2 fragment goat anti-rabbit IgG (H+L) was purchased from ' 
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Dianova (Hamburg, Gennany). N-glycosidase F was obtained from Roche (Mamiheim, 
Germany). 

Example 2; Cell culture. JAR chbriocarcinbma cells releasing XT in the cell culture 
supernatant were cultured in Ultradoma-PF medium containing 10% heat-inactivated fetal calf 
. : 5 serum, 100 U/ml penicillin, 100 pg/ml streptomycin and 250 ng/ml amphotericin B in a 

humidified atmosphere of 5% C02 and 95% air at 37°C. After mcubation for 24 h in the serum- 
containing medium, the cell cultivation was adapted to serum-firee conditions as described 
previously (6). Scaling up of XT production was carried out in three hybrid hollow-fiber 
bioreactors TECNOMOUSE®. During the exponential growth the cells fi'om three 175 cm 2 T- 

10 flasks (> 3 X 10 ^ cells) were detached with 0.5% trypsin and 0.2% EDTA in Dulbecco's 

phosphate-bulBFered saline by incubation at 37°C for 10 min. After centrifixgation (5 min, 1000 x 
g) of the cell suspension the cell pellet was resuspended in 10 ml 37°C warm serum-free and 
protein-free Ultradoma-PF medium and washed three tiines with 20 ml of the same mediirai. 
The cell suspension was drawn into a 10 ml syringe and then inoculated into the extracapillary 

IS space (EC space) of the reactor. The bioreactor was connected with a 2 liter medium bottle and 
set to 1 50 ml/h in the recirculation mode, and the oxygenation pump was set as described in the 
operating manual. Five days after inoculation a 10 ml syringe was connected to the left hand 
EC port and 10 ml of cell culture supernatant was harvested from the EC space. The harvesting 
was continued every two days over a period of 3 monlhs. Glucose and lactate concentration of 

20 the cell culture supernatant were controlled using the Super G analyzer and the ACA analyzer, 
respectively. The 2 liter medium bottle was replaced every three days. The viability of tiie cells 
was determined by trypan blue exclusion. 

Example 3; Synthesis of the protamine affinity matrix. Protamine chloride was immobilized as 
ligand on POROS 20 AL. 30 mg ligand was dissolved m 10 ml of 10 mM phosphate / 0.15 M 

25 NaCl, pH 7.4. After the protein had been dissolved, 5 ml of 1 00 mM phosphate / 1 .50 M NaCl, 
pH 7.4 was added. NaCNBHa was dissolved in the ligand^uffe^ solution to a final 
concentration of 5 mg/ml and 1.0 g POROS 20 AL was suspended in the same solutioiL 
The suspension was mixed gently on a shaker for 1 minute at RT. An additional 2 ml of 100 
mM phosphate / 1 .50 M NaCl, pH 7.4 was added to the suspension and the mixture was shaken 

30 continuously. This step was repeated every five minutes until the mixture volume was 25 ml. 
After additional shaking for 2 h, the medium was filtered on a sintered glass fimnel. The matrix 
was suspended m 20 ml of 0.2 M Tris/HCl / 5 g/l NaCNBHa, pH 7.2 and mixed gently on a 
shaker for 30 min at RT. After the media had been washed in a sintered glass fimnel using 100 
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ml of 10 mM phosphate, pH 7.4, 100 mi of 1,0 MNaCl and another 100 ml of 10 mM 
phosphate, pH 7.4, the matrix was packed m a PEEK column (4.6 x 50 mm). 
Example 4; Synthesis of the aprotinin affinity matrix. Aprotinin affinity matrix was synthesized 
according to tiie synthesis of the protamine afGnity matrix, but using aprotinin as iigand. After 

5 immobilization of the ligand the matrix was packed in a PEEK column (4.6 x 50 mm). 
Example 5: Piirification of xylosyltransferase from JAR cell culture supernatant. 
Fractionated ammonium sulfate precipitation and chromatography steps were performed at RT, 
ultraJBltration and diafiltration were carried out at 4°C. 18-5 liters JAR cell culture supernatant 
collected from three hybrid hollow-fiber bibreactors, TECNOMOUSE®, each containing 5 

10 culture cassettes, was concentrated to 800 ml with ultrafiltration cells using YMl cellulose 
membranes. The retentale was centrifuged at 4,000 x g for 1 h. The supernatant was decanted, 
and the pellet was discarded. 

Step 1: Fractionated ammonium sulfate precipitation - Solid ammoniimi sulfate was added to 
the supernatant to 28% saturation. After 1 h at RT the suspension was centrifiiged at 4»000 x g 

15 for 2 h, tiie stq)ematant was decanted, and the precipitate was removed. Additional ammonium 
sulfate was added to the solution to the point of 40% saturation, and the suspension was 
allowed to stand for 1 h. To recover the precipitate the supernatant was decanted after the 
suspension was centrifuged at 4,000 x g for 2 h. Before chromatography on immobilized 
heparin the precipitate was dissolved in 460 ml buffer A (20 mM sodixmi acetate, pH 6.0). 

20 Step 2: Heparin cffinity chromatography on POROS 20 HE2 - The step 1 product was passed 
through a 02 Jim filter. 4.0 ml of the filtrate was applied to a POROS 20 HE2 column (16 x 100 
mm) equilibrated with buffer A at a flow rate of 40 ml/min. After washing the colunm with 100 
ml of buffer A tiie XT activity was eluted with the same buffer containing NaCl. The NaCl 
concentration was increased stepwise: 20 ml buffer A / 0.09 M NaCl; 20 ml buffer A / 0. 1 5 M 

25 NaCl; 30 ml buffer A / 0.24 M NaCl; 24 ml buffer A / 0.30 M NaCl; 24 ml buffer A / 0.60 M 
NaCl; 24 ml buffer A / 1 .00 M NaCl; and 24 ml buffer A / 1 .89 M NaCl. Fractions of 38 ml 
each were collected and the XT activity was measured. The procedure was repeated 115 times 
by cyclic chromatography and the firactions containing XT activity (1 15 x 38 ml) 
were collected. 

30 Step 3: Ion exchange chromatography on POROS 20 HQ - Collected firactions firom step 2 were 
desalted using diafiltration with YMl cellulose membranes and ultrafiltration cells. After 
concentration of the desalted protein solution to 0.05 liter using analogous techniques the XT- 
enriched solution was subjected to ion exchange chromatography. 4.0 ml of the XT solution 
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was applied onto the POROS 20 HQ column (16 x 100 mm) previously equilibrated with buffer 
A at a flow rate of 40 ml/min. The column was washed wifli 80 ncd buffer A, and the adsorbed 
protein was eluted stepwise usmg the same buffer containing 0.07 M NaCl (88 ml), 0. 1 8 M 
NaCl (120 ml), and 0.36 M NaCl (120 ml) followed by a linear gradient of 0.36 - LOO M NaCl 
5 . , (200 ml) and another step of buffer A / 2.0 M NaCl (120 ml). 50 ml fractions were collected 
and assayed for activity and evaluated by SDS-PAGE. Chromatography was repeated 13 times, 
and the fractions exhibiting XT activity (1 3 x 50 ml) were collected for affinity 
chromatography. 

Step 4: Affinity chromatography on protamine chloride - XT-containing solution from step 3 
. 10 was desalted as described above and concentrated to 5 ml by ultrafiltration with YMl cellulose 
membranes. The ultrafiltration product was passed through a 0.2 |im filter. 100 |il of the filtrate 
was loaded onto a protamine chloride - POROS column (4.6 x 50 mm) equilibrated with buffer 
A. The flow rate was 10 nfil/min. The column was washed with 10.0 ml of buffer A, and the 
adsorbed firaction was eluted with the same buffer containing NaCl by a stepwise increase of 

15 the NaCl concentration: 6.6 ml buffer A / 0.04 M NaCl; 6.6 ml buffer A / 0,06 M NaCl; 6.6 
ml buffer A / 0.23 M NaCl foUowed by a linear gradient of 0.23 - 1 .20 M NaCl (4.2 ml) 
in buffer A. Fractions of 6.0 ml were collected, assayed for XT activity and evaluated by SDS- 
PAGE. Cyclic chromatography was repeated 50 times. The purified enzyme was collected, 
concentrated to 1 .0 ml using ultrafiltration techniques and stored at - 75 °C. 

20 Step 5: SDS-Polyacrylamide gel electrophoresis (SDS-PAGE) - The protem composition of 
' various firactions was estinMted by SDS-PAGE. Briefly, 12.1 |il of sample was added to 4,7 |il 
of sample buffer (1.00 M Tris/HCl / 1.17 M sucrose, 0.28 M SDS, 2.08 mM EDTA, 0.88 mM 
Serva Blue G250, 0.70 mM phenol red, 0.10 M DTT, pH 8.5) and heated for 10 minutes at 
99**C. After tiie sample had been loaded, SDS-polyacrylamide gel electrophoresis was carried 

25 out on a 4- 12% bis-tris polyacrylamide gel with 3-(N-moiphoIino) propanesulfonic acid 

(MOPS) runnmg buffer (1.00 M MOPS/Tris / 69,3 mM SDS, 20.5 mM EDTA, pH 7.7). Protein 
bands were detected by Coomassie Brilliant Blue or by silver staining. The Coomassie bands 
were excised and characterized by MALDI mass spectrometry and amino acid sequence 
analysis. 

30 Example 6: MALDI mass spectrometry, Coomassie-stained proteins were excised from the gel, 
repeatedly washed with H2O and HaO/acetonitrile and digested overnight with trypsin and 
endoproteinase Lys-C at 37*'C. The peptides generated in the supernatant were analyzed by 
MALDI mass spectrometry. Sample preparation was achieved following the thin fihn 
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preparation techniques (13). Briefly, aliquots of 0.3 jil of a nitrocellulose containing saturated 
solution of a-cyano-4- hydroxycinnamic acid in acetone were deposited onto individual spots 
on the target. Subsequently, 0.8 ^1 10% formic acid and 0.4 \il of the digest sample was loaded 
on top of the thin film spots and allowed to dry slowly at ambient temperature. To remove salts 

5 . from the digestion buffer the spots wa:e washed with 10% fonruc acid and with H2O. 
mass spectra were recorded in the positive ion mode with delayed extraction on a Reflex n 
time-of-flight instrument equipped with a SCOUT multiprobe inlet and a 337 mn nitrogen laser. 
Ion acceleration voltage was set to 20.0 kV, the reflector voltage was set to 21 .5 kV and the 
first extraction plate was set to 15.4 kV. Mass spectra were obtained by averaging 50-200 

10 individual laser shots. Calibration of the spectra was performed internally by a two-point linear 
fit using the autolysis products of trypsin at m/z 842.50 and m/z 2211. 10. 
Example 7; Amino acid sequence analysis of XT. The MW 120.000 Coomassie-stained protein 
was excised firom the gel, repeatedly washed with H2O and HiO/acetonitrile and digested with 
trypsin and endoproteinase Lys-C overnight. For HPLC separation the excised gel fragment 

15 was extracted twice with 0. 1 % TF A/60% acetonitrile. The extracted enzymatic fiagments were 
separated on a capillary HPLC system equipped with a 140B solvents delivery system 
(PE Biosystems), Acurate splitter (LC-Packings), UV absorbance detector 759A 
(PE Biosystems), U-Z c^illary flow cell (LC-Packings) and Probot firaction collector 
(BAI) using reversed-phase colxmm (Hypersil CI 8 BDS, 3 [im, 0.3 x 150 mm) and a linear 

20 gradient firom 12% acetonitrile, 0. 1% TFA to 64% acetonitrile, 0.08% TFA in 90 min with a 
flow rate of 4 jil/min at RT. Peptide elution was monitored at 214 nm and individual fi^ctions 
firom the HPLC separation were analysed by MALDI mass spectrometry. Sequrace analysis 
of separated fragments was performed on a Precise Protein Sequencer 494 cLC using standard 
programs supplied by PE Biosystems. 

25 Example 8: Determination of XT activity. Determination of XT activity is based on the 
incorporation of [ 14 C]-D-xylose into the recombinant bikunin according to a previously 
described method (Brinkmann et al., 1997, J. Biol. Chem., 272, 11171-11175). For analysis of 
the substrate specificity of the recombinant xylosyltransferases synthetic peptides containing 
the XT recognition sequence were used as acceptor. The reaction mixture for the assay 

30 contained, in a total volume of 100 50 jil of XT solution, 25 mM 

4"morpholineethanesulfonic acid (pH 6.5), 25 mM KCl, 5 mM KF, 5 mM MgCli, 5 mM 
MnCh, 1.0 ^M UDP-[''^C]-D-xylose, and 1.5 \iM of the synthetic peptides. After incubation for 
75 min at 37°C, the reaction mixtures were placed on discs (25 mm diameter) of Immobilori- 
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AV membrane, which umnobilizes even small peptides by covalent links (Pfimd & Bourdage, 
1990, Mol Immunol 27, 495-502), and allowed to dry. It was then washed for 10 min with 
10% trichloroacetic acid and three times with 5% trichloroacetic acid solution. Incorporated 
radioactivity was determined by liquid scintillation counting. 

5 Example 9: Antiserum Preparation The syntiietic peptide CSRQKELLKRKLEQQEK 
deduced from the sequenced peptides 2 and 10 of the enzymatically cleaved XT v^as 
synthesized, purified by HPLC and used for immunization of rabbits (Bioscience, Germany). 
Polyclonal antiserum was obtained by injection of the above antigen followed Example 10: 
Preparation of solid-phase antigen, the antigen CSRQKELLKRKLEQQEK was 

10 immobilized on POROS 20 EP. After 1 .6 mg antigen was dissolved in 1 .2 ml of 10 mM 
phosphate / 0.15 M NaCl, pH 7.4 0.60 ml of 100 mM phosphate / 1 .50 M NaCl, pH 7.4 was 
added. 400 mg POROS 20 EP was suspended in the solution and the suspension was mixed 
gentiy on a shaker at RT. At 10-min intervals, five times in total, an additional 0.24 ml of 100 
mM phosphate / 1 .50 M NaCl, pH 7.4 was added to the su^ension. After additional shaking for 

15 5 days at RT tiie suspension was filtered on a sintered glass fimnel. The naatrix was suspended 
in 4 ml 0.2 M phosphate / 0.1 M 2-mercaptoetfaanol, pH 7.4 and mixed on a shak^ for 2 h at 
RT. The matrix was washed in a sintered glass funnel using 20 ml of 10 mM phosphate / 0.15 
M NaCl, pH 7.4 and 20 ml of LO MNaCL After additional washing with 20 ml of 10 mM 
phosphate, pH 7.4, the matrix was packed in a PEEK column (4,6 x 50 mm). 

20 Example 11: Antibody purification. Antiserum was adjusted to 50 mM Tris/HCl, pH 8.0. 
The solution was clarified by passage through a 0.2 \im filter. 0.4 ml filtrate was applied at 10 
ml/min to the antigen column previously equilibrated with buffer B (50 mM Tris/HCl, pH 8.0). 
After the column was washed with 4.1 ml buflFer B and with 12,5 ml buffer B / 0.15 M 
NaCl, the adsorbed antibody was eluted using 12.4 ml of 50 mM sodium citrate / 0.15 M NaCl, 

25 pH 3.0 followed by 3.4 ml of 100 mM sodium citrate / 1.5 M NaCl, pH 3.0. The eluate was 
collected as 1 0 ml fractions in tubes containing 2 ml of 0.5 M Tris/HCl, pH 8.0, to immediately 
neutralize the citric acid. 

Example 12: Preparation of immunoaffinity column. Purified antibody was concentrated 
to a protein concentration of 0.3 mg/ml using ultrafiltration with YMl cellulose membranes. 
30 The antibody solution was adjusted to 1 0 mM phosphate / 0. 1 5 M NaCl, pH 7.4. After filtration 
of the solution through a 0.2 ^im filter 100 ^il filtrate was applied at 0.2 ml/min to a POROS 20 
PA column (2.1 x 30 mm). This step was repeated 17 times. Adsorbed antibody was cross- 
linked using cross-linking solution (100 mM triethanolamine, pH 8.5). After the column was 
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washed with 5 ml of buffer C (10 mM phosphate / 0.15 M NaCl, pH 7.4) 2 ml cross-linking 
solution was applied at O.S ml/min onto the cartridge. The procedure was repeated 6 more 
times, using a total volume of 14 ml cross-lmking solution. To block unreacted functional 
groups on the cross-linking reagents 2 ml of 100 mM monoethanolamine, pH 9.0 

5 (quenching solution), was loaded onto the cartridge at O.S ml/min. The column was 
washed using 2 ml of buffer C and the cross-Unking step was repeated using another 2 
ml quenching solution. The immunoafSnity column was cycled between buffer C and 12 mM 
HCl / 0. 1 5 M NaCl 3 times using a total volume of 12 ml of solution. 
Example 13: Immmoqffimty columnpvrification of XT. XT-containing fractions eluted 

10 from the heparin afBnity matrix were desalted using diafiltration with YMl ceUulose 

membranes and passed through a 0.2 \im filter. 100 |xl of this XT-sample was applied to the 
immunoaifinity column equilibrated with buffer D (20 mM Tris/HCl, pH 8.0) at a flow rate of 1 
ml/min. The column was washed with 1.4 ml of buffer D and with 8.5 ml of buffer D / 0.15 M 
NaCl. The XT activity was eluted with 4.2 ml of 12 mM HCl followed by 1 .2 ml of 12 mM 

15 HCl / 1 .5 M NaCl. Alternatively the elution was performed using 100 jtU of antigen at 1 mg/ml 
in buffer D. Fractions (1 ml) were collected into tubes containing 1 ml of 0.1 M Tris/HCl, 
pH 8.0. The XT activities of the fractions were determined. 

Example 14: Aprotinin affinity chromatography. 200 |iil of desalted XT solution from the 
heparin purification step was applied at 10 ml/min to the aprotinin column previously 

20 equilibrated with buffer A. After washing the colunm with 6.6 ml of buffer A the adsorbed ' 
protein was eluted stepwise using the same buffer containing 0.3 M NaCl (10.0 ml), 0.54 M 
NaCl(lO.Oml), LOOM NaCl (10.0 ml) and 1 .50 M NaCl (2.4 miV Example 15; Western 
Blot Analysis, For western blot analysis, proteins were transferred to polyvinyUdene 
difluoride membrane in a semi-dry instrument (Novex). After transfer nonspecific 

25 antibody binding sites were blocked with 2% BS A in 0. 1 M Tris/HCl, pH 7.2, for 1 h at RT. 
The membrane was incubated with antiserum in 50 mM phosphate / 0.15 MNaCl, 0.5 
ml/1 Tween 20, pH 7.4 at 1:1000 dilution for 1 h. Bound antibody was detected usiag a second 
anti-rabbit goat immunoglobulin coupled to horse-radish peroxidase at a 1:1000 dilution. The 
blot was developed using 4-chloro-l-naphthol. 

30 Example 16: Gel filtration chromatography. A sample of 100 nl from the heparin 

purification step was applied at 1 .0 ml/min to a TSK G3000 S W column (30 cm x 7,5 mm, 10 
jxm particle size) which had previously been equilibrated with buffer A / 0.15 M NaCl. Proteins 
were eluted with the same buffer. Fractions of 200 jil were collected and tested for XT activity. 
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Colmnn calibration was performed using thyroglobulin (669.000), ferritin (440.000), 
aldolase (158.000), albumin (67,000), ovalbumin (43.000). 

Example 17: N-glycosidase F digestion, Aliquots (1 jig) of XT were digested with 3,1 x 
. 10"^ units of N-glycosidase F at 37°C for 1 h and 12 h (Table II). The samples were tiien 
5 subjected to SDS-P AGE, and protein bands were detected by silver staining. . ^ . . 

Example 18: Measurement of protein concentration. Protein concentration was estimated 
by absorbance at 280 mn assuming E i cm = 10.0 or with the Bicinchoninic Acid Protein - 
i4 way using bovine serum albuinin as a standard. 

Example 19: PCR-based cloning of human xylosyltransferase. Degenerate oligonucleotide 

10 primers with deoxyinosine substitution were designed based upon the amino acid sequence of 
peptides obtained after digestion of the isolated human XT with trypsin or Lys-C. The first 
strand of cDNA was synthesized by the reverse transcription reaction using poly(A)"** RNA 
isolated jfrom the chondrosarcoma cell line SW1353 as template and oligo(dT) as primer. The 
reverse transcription reaction was performed at 3TC for 2 h using 50 pmol of oligo(dT) 

15 primers, 1 fig of poly(A)'*' RNA, a 0.5 mM concentration of each dNTP, Ix RT buffer and 200 
units of RNase H deficient Moloney murine leukemia virus reverse transcriptase (Life 
Technologies, Eggenstein, Germany) in a final volume of 20 ^1. For PGR amplification the 
reaction mixture contained 4 |il of the reverse transcription reaction solution, 50 pmol of each 
primer, a 0.25 mM concentration of each dNTP, 50 mM KCl, 10 mM Tris-HCl (pH 8.3), 1.5 

20 mM MgCl2 and 2.5 units of hot start Taq polymerase (Life Technologies) in a final volume of 
50 Amplification with degenerate oligonucleotide primers was carried out by 40 cycles at 
94**C for 1 min, 48*^0 for 1 mm, and 72°C for 2 min, followed by a final extension step at 72*C 
for 15 min. After agarose electrophoresis of the PGR products, DNA fragments were excised, 
subcloned into the pGR2.1 vector (Invitrogen. Groningen, Netherlands) and sequenced by the 

25 dideoxy chain termination method usmg Taq DNA polymerase (Big-dye terminator cycle 

sequencing kit, Perkin-Elmer, Norwalk, CT, USA) with an automated DNA sequencer (Applied 
Biosystems, Weiterstadt, Germany). Several clones were sequenced to compensate for 
misreading by Taq polymerase. 

Example 20: Rapid amplification of 5 ' and 3 * cDNA ends (RACE). For amplification of the 5' 
30 and 3 ' ends of the XT-I and XT-II isoform cDNA RACE experiments were perfomied using 
commercially available systems (Clontech, Heidelberg, Germany; Life Technologies) according 
to the manufacturers' instructions. For 3 '-RACE 1 ng placenta poly(A)"*^ mRNA (Clontech) was 
reverse transcribed with a 3'-CDS primer (Clontech). PGR amplification of the 3' cDNA end of 
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human XT-I was accomplished accordmg to a touch-down PGR protocol (5 cycles: 94°C for 30 
s, 72°C for 3 min, 5 cycles: 94°C for 30 s, 70°C for 45 s, 72^C for 3 min, 25 cycles: 94X for 30 
s, 65°C for 45 s, 72°C for 3 min) with the gene-specific primer GSPlb 5'- 
GTGGGTATGCAGAAGTGGGGGAAGGGAC-3' and the UPM primer mix (Clontech). An 

5 aliquot of the first PGR was subjected to semi-nested PGR (30 cycles) using the primer 
Gon21 1 1 5*-CCGTGCGCAATGCGTACA-3' and the UPM primer mix. The DNA fiagments 
obtained were subcloned into the vector pGR2.1 (Invitrogen) and sequenced, PGR amplification 
of the 3' cDNA end of the XT-II isofonn was carried out with the primers AB10267 5'- 
ACTGAGGTCACGCAATACAA-3' and UPSl usmg the touch-down PGR protocol. For 5'-RAGE 

10 1 ^g of placenta polyCA)"*" mRNA was subjected to a reverse transcription and 5' tailing reaction 
with the 5 '-CDS primer and the SMART oligo (Glontech) according to the manufacturer's 
protocol. The 5' end of the XT-I cDNA was amplified using the primer GSP_776L 5'- 
GCCGCACTCAG-GTGATGAAGAAGT.3' and UPM with a touch-down PGR protocol. An 
aliquot was used as template in a second semi-nested PGR reaction with the primers GSP_503L 

15 5'-ACCA-CCAGGACAAAGGGGATTCTGA-3 ' and UPM. The 5'-cDNA end of XT-E was 

obtained by 5'-RACE amplification using the primers AB10315L 5'-AGTCGAACAGTCCAGG- 
GCCC-3' and UPM mix. A new primer for a 5'-RAGE reaction was designed based upon the 
nucleotide seqxience of the largest firagment obtained (2 kbp), and the experiment was repeated 
with the primer AB5846L 5'.GACGATCTCGCAGTTGGGGG-3' as described above. The 

20 nucleotide sequences of human XT-I and XT-II cDNA have been submitted to the 
GenBank/EBI Data Bank with the accession numbers AJ277441 and AJ277442. 
Example 21: Isolation of XT-I cDNA from human brain and chondrocyte cDNA libraries. 
Plasmid-DNA was isolated firom a human whole brain cDNA library (Life Technologies) and a 
human chondrocyte cDNA library (Glontech) and used as template in a PCR-based approach 

25 for isolation of the 5' end of the XT-I cDNA. Briefly, GSP_776L and 5'ADLD 5'- 

CTATTCGATGATGAAGATACCCCACCAAACCC-3' primers were used in a PGR reaction with 
1 |ig of plasmid DNA as template. The DNA firagments were subjected to agarose gel 
electrophoresis and 0.7-3 kb long firagments were excised from the gel and used as a template in 
a semi-nested PGR reaction with GSP_503L and 5'-ADLD primers. DNA fragments were then 

30 subcloned into the vector pGR2.1 and sequenced. 

Example 22; Cloning of rat XT-I and XT-II cDNA, A RT-PGR based approach using primers 
based upon the nucleotide sequence of human XT-I and XT-II was employed for amplification 
of XT-I and XT-II cDNA from rat. The first strand of cDNA was synthesized by the reverse 
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transcription reaction using polyCA)"^ UNA isolated &om the rat liver cell line BRL3A as 
template and oligo(dT) as primer. DNA fragments obtained after PGR amplification using 
moderate stringent conditions were subcloned into the vector pCR2.1 and sequenced. The S' 
- and 3 ' ends of the XT-I and XT-II cDNA from rat tissue were amplified using the RACE 
5 strategy with gene-specific primers as described above. The nucleotide sequences of rat XT-I 
and XT-n cDNA have been submitted to the GenBank^BI Data Bank with the accession 
numbers AJ295748 and AJ295749. 

Example 23; Expression levels of the XT-I and the XT-U, isoform in human tissues. 

A Human Multiple Tissue cDNA Panel (Cfontech) was uised for the analysis of expression 

10 levels. Levels of amplification of the housekeeping gene glyceraldehyde-3-phosphate 

dehydrogenase, whose transcript is always present in the tissues at an almost constant level, 
were determined in parallel for quality control For amplification of the XT-I encoding 
transcript the primers 128U and 601L were used, whereas primers AB10267U and AB12394L 
5'-GGAAGAGCTGGGTGTGGAAT-3' were employed for the XT-II. PGR reactions were carried 

15 out by 22-36 cycles at 94T for 30 s, 60-65°C for 45 s and 72°C for 2 nun. Amplification of a 
transcript was performed using a serial number of cycles to find the conditions for 
semiquantitative amplification, and aliquots were analyzed by agarose gel electrophoresis. 
Example 24; Transfection and transient expression of XT, For construction of a eukaryotic 
expression vector a DNA fragment including the known coding sequence of XT-I cDNA was 

20 amplified by PGR using XT__ExplL 5'.TTTCCCGTTGAGATCCTGCT.3' and XT_Exp3U 5'- 
ACAGACAGCAACAACGAGAA-3' as primers and placenta first-strand cDNA (Clontech) as 
template. The 2450 bp fragment obtained was cloned into the vector pcDNA4/HisMax-T0P0 
(Invitrogen) resulting in the fiision of XT-I to i}a&Xpress epitope. The plasmid pCG227-XT-I 
was then transiently transfected into CHO-Kl cells. The coding region of the XT-II cDNA was 

25 amplified by PGR using the primers AB_E3q)9U S*-AAAGGAAGGCAGAGGAAGC-3' and 
AB_Exp3L 5'-ACCCCTCCACTGT-CTGTAAG-3' and placenta first-strand cDNA as template. 
The obtained 2440 bp DNA firagment was cloned into the expression vector pcDNA4/HisMax- 
TOPO resulting in the fusion of XT-II to the aminotenninal J^yrejj epitope. The plasmid 
termed pCG226-XT-II was transiently transfected into CHO-Kl cells. 2x10^ cells precultured 

30 for 1 day in a 35 mm diameter cell culture dish were transfected with 2 |ig of plasmid DNA and 
6 jil of Fugene 6 transfection reagent (Roche, Mannheim, Germany). For determination of 
transfection efficiency CHO-Kl cells were transfected with 2 ^ig of the control plasmid 
pcDN A4/HisMax-T0P0-/acZ. 48 h after transfection the cell culture medium was harvested 
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Protein G s^arose beads (Sigma, Deisenhofen, Germany) and mouse aaldrXpress monoclonal 
antibody (Invitrogen) were added to the cell culture supernatant and incubated at 4^C for 1 h. 
After centrifiigation at 10.000 g for 1 min the absorbed proteins were twice washed with PBS 
and resuspended in a final volume of SO |il. XT activity was then assayed in the samples. 

5 Example 25; Characterization of the reaction products. For characterization of the reaction 
product of recombinant XT-I the peptide QEEEGSGGGQK, which is homologous to the amino 
terminus of bikunin (Brinkmann et al, 1997, I.e.), was used as acceptor in the XT activity 
assay. After incubation for 75 min at 37®C the enzyme was heat-inactivated by incubation for 
1 5 min at 65**C and the reaction mixture was used for a- and p-xylosidase treatment and 

10 alkaline p-elimination. For the linkage-specific digestion of the bound ['"^CJ-D-xylose 4 mil of 
a-xylosidase (Seikagaku Corporation, Tokyo, Japan) or 4 mU of P-xylosidase (Sigma, 
Dreieich, Gennany) were added to the samples and incubated for 60 min at 3TC. The reaction 
mixtures were then placed on Immobilon- AV membrane discs and allowed to dry. The discs 
were washed with trichloroacetic acid as described above and the remaining uicorporated 

IS radioactivity was deteimined against appropriate controls. The alkaline cleavage of the 0- 
glycosidic linkaige between the [^"^CJ-D-xylose and the p-hydroxyamino acid serine was 
performed as described elsewhere (Montreuil et al., 1994). Briefly, the reaction mixture was 
adjusted to pH 10 with diluted NaOH and an equal volume of cold, freshly prepared 
sodiumboro-hydride solution (2 M sodiumborohydride in 0.1 M NaOH) and was added. After 

20 iacubation at 4S°C for 16 h the cooled solution was neutralized by adding 50% acetic acid and 
placed on Immobilon- AV membrane discs. After drying the discs were washed witii 
trichloroacetic acid and the remaining radioactivity was measured by liquid scintillation 
counting. 
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Patent Claims 

I . An isofonn of UDF-D-xyloscproteogfycan core protein JS-D-xylosyltransferaseQiT). 

S 2. A protein comprising an amino acid sequence of the isoform of claim 1 or a portion thereof 
having the biological activity of XT. 

3. A protein of claim 1 or 2 deriving from human or rat sources (hXT, rXT) 

10 4. An isolated protein according to claim 3, wherein said hXT has a molecular weight of 
120.000 under SDS polyacrlylamide gel electrophoresis conditions. 

5. A recombinant protein of claim 3, wherein said protein is hXT-I comprising at least 827 
amino acids. 

15 

6. A protein accordiiig claim S, having the amino acid sequence-as depicted in Fig. 7B. 

7. A protein according to claim 3, wherein said protein is hXT-II comprising 865 amino acids 
and is an isofonn of hXT-I. 

20 

8. A protem according to claim 7 exhibitmg approximately 55% overall sequence identity to 
hXT-I 

9. A protein according claim 8, having the amino acid sequence as depicted in Fig. 8B. 

25 

1 0. A DNA sequence coding for a protein of any of the claims 1-9. 

I I . A DNA sequence according to claim 10 comprising the nucleotide sequence coding for 
hXT-I as depicted in Fig. 7A, or rXT-I as depicted in Fig. 9. 

30 

12. A DNA sequence according to claim 10 comprising the nucleotide sequence coding for 
hXT-U as depicted in Fig. 8A or rXT-II as depicted in Fig. 1 0. 
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13. An expression vector comprising a promotor sequ^ce, a DNA sequence of claim 10 and 
optionally a signal sequence. 

14. An expression host cell comprising a vector of claim 13, said host cell being capable of 
expressing a protein of any of the claims 1 - 9. ^ 

15. An antibody directed against a protein as defined in any of tiie claims 1- 9. 

16. A process for isolating and purifying a protein as defiaed in claims 1-4, characterized by 
the following steps: 

(i) culturing cells having an increased level of XT, and harvesting the 
supernatant of said cell culture, 

(ii) firactionated ammonium sulfate precipitation of the supematant of step (i), 

(iii) heparin affinity chromatography of the precipitate of step (ii), 

(iv) ion exchange chromatography of the step (iii) product, 

(v) affinity chromatography of the step (iv) product, and 

(vi) SDS-Polyacrylamide gel elektrophoresis of step (v). 

17. A pharmaceutical composition comprising a protein of any of the claims 1-9 and a 
pharmacologically acceptable carrier, diluent or excipient 

1 8. Use of a protein of any of the claims 1 - 9 for the manufecture of a medicament for the 
treatment of sclerotic diseases and chronic inflammatory joint diseases. 

19. Use of claim 1 8, wherein said medicament is an inhibitor or antagonist of said proteiiL 

20. Use of a protein of any of the claims 1 - 9 as diagnostic marker. 

21. Use of a DNA molecule of any of the claims 10 - 12 as gene marker. 

22. Use of an antibody as defined in claim 15 in an immunological assay for determination of 
protein having the biological activity of hXT as diagnostic tool. 
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Fig. 2 
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Fig. 4 
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Fig. 5 
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SPPSl 5'-ATH GCI AGY CCI CCI AGY GA-3' 

Peptide 4 lie Ala Ser Pro Pro Ser Asp Phe Gly Arg 

Inv2b 5 '-TAD CGN AGI GGN GGI AGI CT-3' 

Inv2c 5»-TAD CGlSf TCR GGI GGI tCR CT-3' 

DFl 5 '-GAY TTY GAR AAY GTI GAY AA-3' 

P^txcle 6 Asp Pile Glu Asn Val Asp Asn Ser Asn Plie Ala Pro Arg 



PFF-sense 5' -CCI ACl TTY TTY GCN CG-3' 

Peptide 7 Pro Thr She Pihe Ala Arg 

Peptide 9 Glu Xieu Gly Ala Val Jdys Pro Asp Gly Arg lieu 

Lysc-invl 3'-CCN CGI CAN TTY GGN CTR CC-5' 
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Fig- 7A 

ACTCAGGATGGCTACTTTTCTCATCGGCCGAAAGAGAAAGTGCGAACAGACAGCAACAACGAGAACTCT 
GTCCCCAAAGACTTTGAGAATGTGGACT^CAGCAACTTCGCACCCAGGACTCAAAAGCAGAAGCACC^^ 
CCTGAGTTGGCGAAGAAGCCACCGAGTAGACAGAAGGAGCTTTTGAAAAGGAAGCTGGAACTVGCAGGAG 
AAAGGAAAAGGACATACATTCCCTGGGAAAGGCCCCGGTGAGGTGCTGCCTCCCGGGGACAGAGCCGCA 
GCCAACAGCAGCCACGGGAAGGATGTGTCCAGACCGCCTCATGCCAGGAAAACTGGGGGCAGCTCCCCC 
GAGACCAAGTATGACCAGCCCCCTAAGTGTGACATCTCAGGCAAGGAGGCCATCTCTGCCCTGTCCCGT 
GCTAAGTCCAAGCACTGCCGCCAGGAGATTGGGGAGACTTACTGCCGCCACAAGTTAGGGCTGCTGATG 
CCTGAGAAGGTGACTCGGTTCTGCCCCCTCGAGGGTAAAGCCAACAAGAACGTGCAGTGGGACGAGGAC 
TCCGTGGAGTACATGCCAGCCAACCCGGTCAGAATCGCCTTTGTCCTGGTGGTCCACGGCCGTGCCTCT 
CGGCAGTTGCAGCGCATGTTCAAGGCCATCTACCACAAAGACCACTTCTACTACATCCACGTGGACAAG 
CGCTCTAATTACCTGCATCGGCAAGTGCTCCAGGTCTCCAGGCAGTACAGCAATGTCCGCGTCACCCCC 
TGGAGAATGGCCACCATCTGGGGAGGAGCCAGCCTCCTGTCCACCTACCTGCAGAGCATGCGGGACCTC 
CTGGAGATGACCGACTGGCCCTGGGACTTCTTCATCAACCTGAGTGCGGCCGACTACCCCATCAGGACA 
AATGACCAGTTGGTGGCGTTTCTCTCCCGATACCGAGATATGAATTTCTTGAAGTCACACGGCCGGGAC 
AATGCAAGGTTCATTCGGAAGCAGGGCCTGGATCGGCTCTTCCTGGAGTGCGACGCTCACATGTGGCGC 
CTGGGAGATCGGCGGATCCCAGAGGGCATTGCCGTGGATGGCGGTTCGGACTGGTTCCTGCTGAACCGG 
AGGTTTGTAGAATATGTGACCTTCTCCACAGACGATCTGGTGACCAAGATGAAACAGTTCTACTCCTAC 
ACCCTGCTTCCTGCTGAGTCCTTCTTCCATACGGTCCTGGAGAACAGCCCCCACTGCGACACCATGGTG 
. GACAACAACCTGCGCATCACCAACTGGAATCGCAAGCTGGGCTGCAAGTGCCAGTACAAGCACATCGfTG. 
GACTGGTGCGGCTGCTCCCCCAATGACTTCAAGCCGCAGGACTTCCACCGCTTCCAGCAGACAGCCCGG 
CCTACCTTCTTTGCCCGCAAGTTTGAAGCCGTGGTGAATCAGGAAATCATTGGGCAGCTGGACTATTAC 
CTGTACGGGAACTACCCTGCAGGTACCCCGGGCCTGCGCTCCTACTGGGAGAATGTCTACGATGAGCCT 
GACGGCATCCACAGCCTGAGCGACGTGACACTCACCTTGTACCACTCCTTTGCCCGCCTGGGTCTTCGA 
CGGGCTGAGACGTCCCTGCACACGGATGGGGAGAACAGCTGCCGATACTACCCAATGGGCCACCCAGCA 
TCTGTGCACCTCTACTTCCTTGCTGACCGCTTCCAGGGCTTTCTGATCAAGCATCATGCTACCAATCTG 
GCTGTGAGCAAACTAGAGACTCTGGAGACCTGGGTGATGCCGAAAAAAGTCTTCAAGATCGCAAGCCCA 
CCCAGTGACTTTGGGAGGCTTCAGTTTTCCGAGGTCGGCACTGACTGGGATGCCAAGGAGAGGCTATTC 
CGCAACTTTGGGGGTCTTCTGGGGCCCATGGATGAGCCGGTGGGTATGCAGAAGTGGGGGAAGGGACCT 
AATGTGACCGTGACCGTCATTTGGGTGGATCCCGTCAATGTCATCGCAGCCACCTACGACATCCTCATT 
GAGTCCACTGCCGAATTCACACACTACAAGCCCCCTTTGAACTTGCCCCTGAGGCCTGGGGTCTGGACA 
GTGAAAATTCTCCACCACTGGGTGCCAGTTGCAGAGACCAAATTCCTCGTTGCGCCTCTGACCTTCTCG 
AACAGGCAGCCCATCAAACCTGAGGAGGCACTGAAGCTGCACAATGGGCCCCTCCGCAATGCCTACATG 
GAGCAGAGCTTCCAGAGCCTATU^CCCCGTCCTCAGCCTGCCCATCAACCCCGCCCAGGTGGAACAGGCA 
CGGAGGAACGCAGCCTCCACGGGCACAGCGCTGGAGGGATGGCTGGACTCGTTGGTGGGCGGGATGTGG 
ACTGCCATGGACATCTGTGCCACGGGCCCCACAGCCTGCCCGGTCATGCAGACCTGCAGCCAGACGGCC 
TGGAGCTCCTTCAGCCCTGACCCCAAGTCGGAGCTGGGTGCCGTCAAACCTGATGGCCGGCTCAGGTAG 
CACTGGGCACGAGGAGTGGGCCACAGCAGGATCTCAACGGGAAAGCAGCCAGAGGGGTTGTGGGGCCTG 
AACCCCGGCCTCCACCCTGGGGGAGGCCCTCTGTGAATGGGTCTCTCCTGGCCATAGAATGATGGAAAG 
GGAAGGTCAGCAGGTCAAAGCAGGATCAGCCAACAACCTGCCTTTGGCAAGCTGCGGGTGGGATGGCTC 
AGTCCCTGCACTGTGACTGTCTCACCTCTTCTGSTTGATCCTCAAGTCCTACAGGTTCCTTGTCTTCCC 
CTTCCAGTGACCCACCCCTGACCCCAGACGTGTGATTTTCAGACTTTTCTTTCGAGCAGCAGAACTTCG 
TTTACGGAGCACAGTCATAAGTGRAGGTTCAGGGTGCTGACGAAATCCAAGCTGCTCTGGTTGAAGCTG 
ACAAGTGCGAGGTTCCCTCCCAAAGCTCAGCCCTCTGGGCGGTCCCCTTGCCCAGGGTATCTCCTACGG 
TACCTCTTCAGAACCCAAGGGCTCTGCAAATGCCAGTTTGACAAGCACTGCCCAGACCAACCATGGGTT 
CAAACTCCAGCCCTGCCCTTTGGTTCATTTTTCTGCTTCTCTTGGCTGGGGGACTCTGGTGCCAGCCTT 
GAAAGTCATGGTCGTGGGCCCTTTCCCATGGAGGCTGCAGCCTTAGGAGAGCTCTGAGCCTCTCAGCAG 
CCCTCCTTGGGTTGAACTATTCTCCTTAGTAACTAGGTAAGTGGGAAAGCCTTTTGATGTGGCATGGCC 
AAGGTCCAGCCACAAGTGCAACTGCCACCTGTCCAGGGGTCTGGGCCTCCTTCCCTCAAGGCTGCCACA 
CAAAGTAGCAGAAATAGGATGATGTTTGTGAGCACCAGACTCAAGACCATGACCTTCTTTGATCCTTGA 
AAATGGGAACTTTGACAGCCATGACCATGAAACTCACAAGGCAACGCGGATGAAACTCACAAAGCAATG 
CTTGGAGCAAAACTCCTGAGCTAGACAGCACAGCAGCACCCATCCCCTGCCAGAGCCCTTCCGTTTGAG 
GTCAGACACACAAAACCTTCGTCAATTGCACACCGGTGCTGTTGGGAGTGACCAAACCACATGAACCAG 
ACTTTTCCCGTCCAGGAAATAGCATTTCAGATTTGGTTTTTAATTTCATGCCCTTCGGCCACAGGCTCA 
ACGGGACATGCAACATAAAAATGGGAAGGTTATTCAACCCGCGGGAAAAAAAAAAAAAAAAAAAAAAAA 
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Fig.7B 
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Fig>8A 

CGCAGGCCCCGCCCCCGGCCCCGCCCCGGCGCGCGCCCGGCCCGCCTCCCTAGGCGTGGAGGAGGGGGGG 

CGGCTCAGCCCCGCGCCCGTGCGCGCGCTCGCGGCCGGGTTGCAGGGCTGGGCGCGCGCCCCGCGTCCCG 

GGCAGGAAGATGGTGGCGAGCGCGCGAGTGCAGAAGCTGGTGCGGCGCTACAAGCTGGCGATTGCCACGG 

CGCTGGCCATCCTGCTGCTGCAGGGCCTGGTAGTGTGGAGCTTCAGCGGCCTGGAGGAGGACGAGGCGGG 

CGAGAAAGGAAGGCAGAGGAAGCCACGGCCACTGGACCCTGGCGAAGGTTCCAAGGACACAGACAGTTCA 

GCAGGGCGACGGGGCAGCACAGGCAGAAGGCATGGGCGCTGGCGGGGCCGTGCTGAGAGCCCAGGAGTGC 

CCGTGGCCAAGGTGGTACGGGCAGTAACCAGCCGGCAGAGAGCCAGCCGGCGGGTCCCACCTGCCCCACC 

TCCGGAAGCCCCAGGCCGCCAGAACCTGAGTGGGGCAGCAGCTGGGGAGGCGCTGGTAGGGGCAGCTGGC 

TTCCCACCACACGGAGATACAGGGAGCGTGGAGGGCGCCCCCCAGCCCACGGACAATGGCTTCACCCCCA 

AGTGCGAGATCGTGGGCAAGGACGCACTGTCTGCACTGGCCCGGGCCAGCACCAAGCAGTGCCAGCAGGA 

GATCGCCAATGTGGTGTGCCTGCACCAGGCTGGGAGCCTCATGCCCAAGGCTGTGCCCCGGCACTGTCAG 

CTGACTGGGAAGATGAGCCCCGGCATCCAGTGGGATGAGAGCCAAGCCCAGCAGCCCATGGATGGCCCCC 

CGGTGCGAATCGCCTACATGCTGGTGGTTCACGGCCGCGCCATCCGCCAGCTGAAGCGTCTCCTCAAGGC 

CGTTTATCACGAGCAGCACTTCTTTTACATCCATGTGGACAAGCGTTCCGACTACCTGCACCGGGAGGTG 

GTGGAGCTGGCCCAGGGCTATGATAACGTGCGGGTGACGCCCTGGCGCATGGTTACCATCTGGGGCGGGG 

CCAGCCTCCTGACGATGTACCTGCGGAGCATGCGGGACCTGCTAGAGGTGCCTGGCTGGGCCTGGGACTT 

CTTCATCAACCTCAGTGCCACTGACTATCCAACCAGGACCAATGAGGAGCTGGTGGCATTCCTATCCAAG 

AACCGGGACAAGAATTTCCTCAAGTCACATGGCCGGGACAACTCCAGGTTCATCAAGAAACAGGGCCTGG 

ACCGGCTCTTCCATGAGTGCGACTCACACATGTGGCGCCTGGGCGAGCGGCAGATCCCAGCAGGCATTGT 

GGTGGATGGCGGTTCTGACTGGTTCGTGCTGACACGCAGCTTTGTGGAGTATGTGGTGTACACAGATGAC 

CCGCTTGTGGCCCAGCTGCGCCAGTTCTACACATACACACTGCTCCCAGCCGAGTCCTTCTTCCACACGG 

TGCTGGAGAACAGCCTGGCCTGTGAGACCCTCGTGGACAACAACCTGCGGGTCACCAACTGGAACCGCAA 

GCTGGGCTGCAAGTGCCAGTACAAGCACATTGTGGACTGGTGTGGCTGCTCCCCCAACGACTTCAAGCCA 

CAGGACTTCCTCCGGCTGCAGCAAGTCTCCAGACCCACCTTCTTCGCCCGGAAGTTCGAGTCGACTGTGA 

ACCAGGAGGTGCTGGAAATCCTGGACTTCCACCTGTACGGCAGCTACCCCCCCGGCACGCCAGCCCTCAA 

GGCCTACTGGGAGAACACCTACGACGCGGCTGATGGCCCCAGTGGGCTCAGTGATGTCATGCTCACTGCT 

TACACAGCCTTCGCCCGCCTCAGCCTGCACCATGCCGCCACTGCTGCACCCCCAATGGGCACCCCACTCT 

GCAGGTTTGAGCCCAGGGGCTTGCCGTCCAGCGTGCACCTGTATTTCTATGACGACCATTTCCAGGGCTA 

CCTGGTGACGCAGGCGGTGCAGCCCTCAGCCCAGGGGCCGGCAGAGACGCTTGAGATGTGGCTGATGCCC 

CAAGGGTCGCTGAAGCTGTTGGGGCGCAGTGACCAGGCCAGCCGGCTCCAGAGTCTGGAGGTTGGCACTG 

ATTGGGACCCCAAAGAGCGTCTTTTCCGGAACTTTGGGGGGTTACTGGGGCCGCTGGACGAGCCTGTGGC 

CGTGCAGCGCTGGGCCCGGGGCCCCAACCTCACAGCCACAGTGGTCTGGATCGACCCAACCTATGTGGTG 

GCCACATCTTATGACATCACAGTAGATACGGAGACTGAGGTCACGCAATACAAGCCCCCACTGAGCCGGC 

CCCTGCGGCCAGGGCCCTGGACTGTTCGACTCCTTCAGTTCTGGGAACCGCTGGGTGAGACCCGCTTCCT 

TGTGCTGCCCTTGACCTTCAACCGCAAACTACCTCTCAGGAAAGATGATGCCAGCTGGCTGCACGCAGGG 

CCACCCCACAACGAGTACATGGAGCAGAGTTTCCAGGGCCTGAGTAGCATCCTGAACCTGCCTCAGCCGG 

AGCTCGCGGAGGAGGCTGCCCAGCGGCACACACAGCTCACAGGCCCTGCGCTCGAGGCCTGGACAGACAG 

GGAACTGAGCAGCTTCTGGTCCGTGGCTGGACTGTGTGCCATAGGCCCCTCTCCCTGCCCCTCCCTGGAG 

CCCTGCAGACTGACCAGCTGGAGCTCTCTGTCCCCCGACCCCAAATCAGAGCTGGGGCCTGTCAAAGCAG 

ACGGGCGACTCAGGTAGCAGGGCCCCAGCCAGTACCCGTGGAGGACCCGGGAAATTGCACCTTACAGACA 

GTGGAGGGGTGTCCCCTCCCACy\GGCAAGAACCAGAGGCCCAGGCTGCACACCCATTTCAGCCA^ 

ACCCACACAGACGGCAGGGAAGGTGGACACAGTATGAACTACTGCTGATGTCTCTGTTGGGGATCAGAGG 

GCTGGCGGGAACGCGAGAAGGGCACCAGCAGCATTCCACACCCAGCTCTTCCTCACCTTCCTGTCTAGTT 

TGAATTTCTTTTTTTTCTTTCTTTTTTTTTTTTTTTTAATTTAAAAAGGAAAATGGGTGGTTGGGAGAGA 

AACTAGAACAGAAGATGTGCAATAGGGCTCAGAGCAGCCCCAGGAAGTGGGCCATGTCTGTGGGGAGCAG 

TGGCTTGCTGAGCCCACCTGCTATTGTTCTGCACGAGGCTGGGCCTGCCTCACTCTCCAGGGCCTCATGC 

CCCATTCTGGGCCTGTGGTGCTCGTGGCTGAGGCTCCACAGGGCTGCAAGTGCCCTGCCAGGCTCTAAGG 

CCCGAAGAACAGGTGATATCGGGGGCGCTGCAGAGCTGGGCTCTAGCAGGACAGTTCTTTTGTAGCCAGG 

GGACCCTAGAAGTGGGGTGGGGCGGCTCCAGGGCTTTCCAGCATACCCTGCCCCTGGATGGGAAAGGCAG 

GGTCAGGGCCCACTGAGACCCATGCAGAGTTTCCAGGTGTGCACTGGAAGTGAGGTCACATGAGCAGCG^ 

GGGAAGAAGACTCTGTCAAGACTCTCAGAAGAACCTGGAGTAATTGTGCCTGAAGCTCAGCGTGAAGTCT 

GAAATATGCAACAGAAGAAATATATCTCTATCTCTCTA 
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Fig, 9 



TACTTGTCTCACCGGCCGAAAGAGAAAGTTCGGACCGACAGCAACAATGAGAACTCAGTCCCCAA 
GGACTTTGAGAATGTGGACAACAGCAACTTCGCACCCAGGACTCAAAAGCAGAAGCACCAGCCAG 
AGTTGGCAAAGAAGCCCCTCAGCAGGCAGAAAGAGCGTTTGCAGAGAAAGCTGGGCGCCCAGGAC 
AAAGGACAGGGGCAGTCAGTCCTAGGAAAAGGCCCCAAGGAGGTGCTGCCTCCTCGGGATVTVAAGC 
CCCAGGCAACAGTAGCCAAGGGAAGGATCTCTCAAGACACAGCCATAGCAGGAAGAGTGGTGGCG 
GTGGGTCCCCGGAAACTAAGTCTGACCAGGTCCCCAAGTGTGACATCTCTGGCAAGGAGGCCATC 
TCAGCGCTGACCCGCGCTAAGTCCAAGCACTGTCGCCAGGAGATTGCAGAAACCTACTGTCGCCA 
CAAGCTGGGGCTGCTGATGCCAGAGAAGGTGGCTCGATTCTGTCCCCTGGAAGGCAAAGCCAACA 
AGAATGTCCAGTGGGATGAGGATGCTGTTGAGTACATGCCCCCCAACCCAGTCAGAATCGCCTTT 
GTCCTGGTGGTCCATGGCCGTGCCTCTCGACAGCTGCAGCGCATGTTTAAGGCCATCTACCACAA 
AGACCATTTCTACTATATCCATGTGGATAAGCGTTCCAATTACCTGCATCGGCAAGTGCTCCAGT 
TCTCCAGGCAGTACGACAACGTCCGAGTCACTTCCTGGAGGATGGCCACCATTTGGGGCGGAGCC 
AGCCTCCTGTCCACTTACCTGCAGAGCATGCGGGATCTACTGGAGATGACTGACTGGCCCTGGC3A 
CTTCTTCATCAACCTGAGTGCGGCTGACTACCCCATCAGGACAAATGACCAGCTCGTAGCATTTC 
TTTCCAGATACCGAGATATGAATTTCCTGAAGTCACATGGTCGGGACAATGCAAGGTTCATCCGG 
AAGCAGGACCTGGACCGTCTCTTCCTGGAGTGTGATACACACATGTGGCGCCTGGGGGACCGGCG 
AATCCCAGAGGGCATTGCTGTAGATGGCGGCTCTGATTGGTTCCTGCTAAACCGGAAGTTTGTAG 
AGTATGTGGCATTCTCCACAGATGACCTGGTGACCAAGATGAAGCAGTTCTACTCTTATACCCTC 
CTCCCTGCTGAGTCCTTTTTCCACACGGTCCTGGAGAACAGCCCCCACTGTGACACCATGGTGGA 
TAACAACCTGCGCATCACCAATTGGAACCGCAAGCTGGGCTGCAAGTGCCAGTACAAGCATATTG 
TGGACTGGTGTGGCTGCTCTCCCAATGACTTCAAGCCTCAGGACTTCCATCGCTTCCAGCAGACG 
GCCCGGCCCACCTTCTTTGCCCGAAAGTTCGAAGCCATAGTGAACCAGGAAATCATTGGGCAGTT 
GGACTCTTACCTGTATGGCAACTATCCTGCGGGCACCCCGGGCCTCCGCTCTTACTGGGAGAATG 
TCTATGACGAACCAGATGGCATCCAGAGCCTCAGCGATGTGGCACTCACCATGTATCATTCCTTC 
ATCCGCCTGGGTCTTCGAAGGGCTGAGTCATCGCTACACACGGATGGGGAGAACAGCTGCAGGTA 
CTATCCAATGGGCCACCCAGCGTCTGTCCACCTCTACTTCCTGGCTGACCGATTCCAGGGCTTTC 
TGATCAAGCACCATGTGACCAACCTTGCTGTGAGCAAACTGGAGACACTGGAGACATGGATGATG 
CCAAAGAAAGTCTTCAAGGTCGCAAGTCCCCCCAGTGACTTTGGGAGGCTTCAGTTTTCTGAGGT 
TGGCACTGACTGGGATGCCAAGGAGAGACTATTCCGGAACTTTGGTGGTCTTTTGGGGCCCATGG 
ATGAGCCGGTGGGAATGCAGAAATGGGGGAAGGGACCCAATGTGACCGTGACTGTTATTTGGGTG 
GATCCTGTCAATGTCATCGCAGCCACCTATGATATCCTGATCGAGTCCACTGCGGAATTCACACA 
CTACAAACCCCCTTTGAATCTGCCTCTGAGGCCTGGTGTCTGGACAGTGAAGATTCTCCATCACT 
GGGTACCAGTTGCAGAGACCAAATTCCTTGTCGCACCTCTGACCTTCTCCAACAAGCAGCCCATC 
AAACCAGAGGAGGCTTTGAAGCTGCACAATGGGCCACCTCGCAGTGCCTACATGGAGCAGAGTTT 
CCAGAGCCTAAACCCAGTCCTCAGCCTGCACATCAATCCTGCCCAAGTGGAGCAGGCCCGGAAGA 
ACGCAGCCTTCACTGGGACAGCACTAGAAGCCTGGCTGGTGGGTGGGACTTGGACTGCCATGGAC 
GTCTGTGCCACAGGCCCCACTGCCTGCCCAGTCATGCAAACCTGCAGCCAAACAGCCTGGAGTTC 
CTTCAGCCCTGATCCCAAGTCAGAGCTGGGTGCAGTGAAGCCTGATGGTCGGCTCAGGTAGCACT 
GGCCACATGGGACTGATACAGCCTGAACCCTGGCCTCTGATCTTATATGGGGGAGGGGTTCCTTC 
TGTTCAAGGACCTCTGTTGCCTGTAGGACAGTGGAAAGAGAGGATTTGGAGGCCAGA 
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Fig, 10 

GCGCTCGCGGCCGGGTTGCAGGGCCGGACGCGCGCCCCGCGTCCCGGGCAGGAAGAT6GTGGCGA 

GCGCGCGGGTGCAGAAGCTGGTGCGGCGCTACAAGCTGGCGATCGCCACCGCGCTGGCCATCCTG 

CTGCTGCAGGGCCTGGTGGTGTGGAGCTTCAGCGGCCTGGAGGAGGACGAGCCGGGCGAGAAAGG 

AAGGCAGAGAAAGCCACGGCCGCTAGACCCTGGAGAGGGTTCCAAGGACACGGACAGCTCTGCAG 

GGCGTAGGGGCAGTGCTGGGAGAAGGCACGGACGGTGGCGGGGCCGCGCCGAGAGCCCAGGTGTG 

CCTGTGGCCAAGGTGGTGCGGGCAGTAACCAGTCGGCAGAGAGCCAGCCGACGGGTGCCCCCTGC 

CCCACCTCCAGAGGCACCAGGCCGCCAGAACCTGAGTGGAGCAGCAGCCGGGGAAGCACTGATAG 

GTGCCGCCGGCTTCCCACAACATGGAGACACAGGGAGTGTGGAGGGTGCCCCTCAGCCCACGGAT 

AACAGCTTCACTCCGAAGTGTGAGATTGTGGGCAAGGATGCACTGTCAGCACTGGCCCGGGCCAG 

CACCAAGCATTGTCAACAAGAGATTGCTAATGTAGTGTGCCTGCACCAAGCTGGGAACCTT^TGC 

CCAAGTCTGTGCCCCGGCACTGCCAGCTGGCTGGCAAGGTGAGCCCTGGCATCCAGTGGGAAGAG 

GTCCGGGCCCAGCAGCCTGTGAGTGGCCCTCTGGTACGCATCGCCTACATGCTGGTGGTTCACGG 

CCGTGCTGTGCGCCAGCTGAAGCGTCTTCTGAAGGCCGTCTACCACGAGGAGCACTTCTTTTATA 

TTCATGTGGACAAGCGTTCCAACTACCTGTACCGGGAGGTGGTAGAGCTGGCCCAGCACTACGAC 

AATGTACGGGTAACACCTTGGCGCATGGTCACCATCTGGGGTGGGGCTAGCCTTCTGAGGATGTA 

CCTGCGGAGCATGAAGGACCTTCTGGAGACTCCTGiSCTGGACCTGGGACTTCTTCATCAACCTGA 

GCGCTACTGACTATCCAACCAGGACGAATGAGGAGCTGGTAGCGTTCTTATCCAAGAACCGGGAC 

AAGAATTTCCTCAAGTCACACGGGCGAGACAATTCCAGGTTCATCAAGAAACAAGGCCTGGACCG 

GCTTTTCCATGAGTGTGATTCCCACATGTGGCGCCTGGGTGAACGGCAGATCCCGGCAGGCATCG 

TGGTGGATGGTGGCTCTGACTGGTTCGTGCTGACACGCAGCTTTGTGGAATATGTGGTGTATACA^ 

GAGGATCCTCTGGTGGCCCAGCTTCGCCAGTTCTATACATACACATTGCTTCCAGCCGAGTCCTT 

CTTCCACACAGTGCTGGAGAACAGCCCAGCCTGTGAGAGCCTAGTGGACAACAACCTGCGGGTTA 

CCAACTGGAACCGGAAGCTGGGCTGCAAGTGCCAGTACAAGCACATCGTGGACTGGTGTGGCTGC 

TCCCCCAACGACTTCAAGCCACAGGACTTCCTGCGGCTTCAGCAAGTCTCCAGACCCACCTTCTT 

TGCCCGGAAGTTTGAGTCGACTGTGAACCAGGAAGTCCTGGAAATTTTGGACTTCCACCTGTATG 

GCAGCTACCCACCCGGCACCCCAGCCCTCAAGC5CCTACTGGGAGAACATCTACGACATGGCCGAT 

GGCCCTAGTGGACTCAGCGATGTCCTACTCACTGCTTACACAGCCTTTGCCCGTATCAGTCTGCG 

TCATGCTGCCACTGTTTCCCCACTGGCCACTGCAGTCTGCAGGTTTGAGCCCAGGGGGTTGCCGT 

CCAGCGTGCACCTGTATTTCTATGACGACCATTTCCAGGGCTACCTGGTGACGCAGGCAGTGCAG 

CCCTCAGCCCAGGGGCCAGCAGAGACACTTGAGATGTGGCTGATGCCCCAGAGGTTGCTGAAGCC 

GTTGGGGCACAGTGACCAGGCCAGCCGGCTCCAGAGTCTGGAGGTTGGCACTGAGTGGGACCCCA 

AAGAACGTCTCTTCCGGAACTTTGGGGGCCTGTTGGGACCACTGGATGAACCTGTGGCCATGCAG 

CGCTGGGCCCGGGGCCCCAACCTCACAGCCACTGTGGTCTGGATTGACCCCACCTATGTTGTGGC 

CACATCCTATGACATCACGGTAGATGCGGACACTGAAGTCACGCAGTACAAGCCCCCACTGAGCC 

TGCCACTGCGGCCAGGAGCCTGGACTGTTCGATTGCTTCAGTTCTGGGAGCCCCTGGGTGAGACC 

CGCTTCCTCGTGCTGCCATTGACCTTCAACCACAAACTACCTCTCAGGAAAGATGATGCCAGCTG 

GCTGCATGCGGGACCACCCCACAACGAATACATGGAACAGAGTTTCCAGGGACTAAGTGGCATCC 

TGAATCTGCCTCAGCCAGAGGCCGTGGAGGAGGCTGCCCGGCGGCACACAGAGCTCACAGGTCCT 

GCACTTGAGGCCTGGACAGATGGGGAACTGAGCAGTTTCTGGTCTGTTGCAGGATTGTGTGCCAT 

AGGGCCTTCTTCTTGTCCCTCCCTGGAGCTCTGCAGACTGACCAGCTGGAGCTCTCTGTCTCCTG 

ACCCCAAGTCAGAGCTGGGGCCTGTCAAAGCTGACGGACGACTCAGGTAGCAGGACCCCAGCCAG 

CACAACCCGGAGGAGCCGGGGAATTGCACCTTACAGACAATGGAGGGACGTCTCTCCTCTGGTAG 

CCAGGGAGCCCTAGAACAGTGGTGGGGCTGTCTGTCTGTCAGGCTTTCCTAGGAAATGGACGACA 

TCAGGCCCTGCTGAGATCCATGAAGTTTCCCCACTGGAGGAGGGAGCCCCACTGGCAGCACAG 
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SEQUENCE LISTING 

<110> Kleesiek, Knat 

<120> Xylosyltransferase and isoforms thereof 

<130> Xylosyltransferase 

<140> 
<141> 

<ieo> 36 

<170> Patentin Ver. 2.1 

<210> 1 

<211> 3726 

<212> DNA 

<213> Homo sapiens 

<220> . 

<221> CDS 

<222> (1) . . (2484) 

<223> human XT-I (not completely) 

<220> 

<221> misc_binding 
<222> (544) . . (552) 

<223> binding motif for nucleotide-sugars 
<220> 

<221> misc_feature 
<222> (280) , . (282) 
<223> N-glycosylation site 

<220> 

<221> misc_feature 
<222> (865) . . (867) 
<223> N-glycosylation site 

<220> 

<221> misc^feature 
<222> (1933) . . (1935) 
<223> N-glycosylation site 

<400> 1 

act cag gat ggc tac ttt tct cat egg ccg aaa gag aaa gtg cga aca 48 
Thr Gin Asjp Gly Tyr Phe Ser His Arg Pro Lys Glu Lys Val Arg Thr 
1 5 10 15 

gac age aac aac gag aac tct gtc ccc aaa gac ttt gag aat gtg gac 96 
Asp Ser Asn Asn Glu Asn Ser Val Pro Lys Asp. Phe Glu Asn Val Asp 
20 25 30 

aac age aac ttc gca ccc agg act caa aag cag aag cac cag cct gag 144 
Asn Ser Asn Phe Ala Pro Arg Thr Gin Lys Gin Lys His Gin Pro Glu 
35 40 45 

ttg gcg aag aag cca ccg agt aga cag aag gag ctt ttg aaa agg aag 192 
Leu Ala Lys Lys Pro Pro Ser Arg Gin Lys Glu Leu Leu Lys Arg Lys 
50 55 60 
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ctg gaa cag cag gag aaa gga aaa gga cat aca ttc cct ggg aaa ggc 240 
Leu Glu Gin Gin Glu Lys Gly Lys Gly His Thr Phe Pro Gly Lys Gly 
65 70 75 80 

ccc ggt gag gtg ctg cct ccc ggg gac aga gcc gca gcc aac age age 288 
Pro Gly Glu Val Leu Pro Pro Gly Asp Arg Ala Ala Ala Asn Ser Ser 

85 90 95 " . 

cac ggg aag gat gtg tcc_aga_ccg cct cat gcc agg aaa act ggg ggc 336 
His Gly Xys'Asp Val Ser" Arg Pro Pro His Ala Arg Lys Thr Gly Gly 
100 105 110 

age tec ccc gag ace aag tat gac cag ccc cct aag tgt gac ate tea 384 
Ser Ser Pro Glu Thr Lys Tyr Asp Gin Pro Pro Lys Cys Asp lie Ser 
115 120 125 

ggc aag gag gcc ate tct gcc ctg tec cgt get aag tec aag cac tgc. 432 ■ 
Gly Lys Glu Ala lie Ser Ala Leu Ser Arg Ala Xys Ser Lys His Cys 
130 135 140 

cgc cag gag att ggg gag act tac tgc cgc cac aag tta ggg ctg ctg 480 
Arg Gin- Glu lie Gly Glu Thr Tyr Cys Arg His Lys Leu Gly Leu Leu 
145 150 155 160 

atg cct gag aag gtg act egg ttc tgc ccc etc gag ggt aaa gcc aac 528 
Met Pro Glu Lys Val Thr Arg Phe Cys Pro Leu Glu Gly Lys Ala Asn 
165 170 175 

aag aac gtg cag tgg gac gag gac tec gtg gag tac atg cca gcc aac 576 
Lys Asn Val Gin Trp Asp Glu Asp Ser Val Glu Tyr Met Pro Ala Asn 
180 185 190 

ceg gte aga ate gcc ttt gtc ctg gtg gtc cac ggc pgt gcc tct egg 624 
Pro Val Arg lie Ala Phe Val Leu Val Val His Gly Arg Ala Ser Arg 
195 200 205 

cag ttg cag cgc atg ttc aag gcc ate tac cac aaa gac cac ttc tac 672 
Gin Leu Gin Arg Met Phe Lys Ala lie Tyr His Lys Asp His Phe Tyr 
210 215 220 

tac ate cac gtg gac aag cgc tct aat tac ctg cat egg caa gtg etc 720 
Tyr lie His Val Asp Lys Arg Ser Asn Tyr Leu His Arg Gin Val Leu 
225 230 235 240 

cag gte tec agg cag tac age aat gtc cgc gtc ace ccc tgg aga atg 768 
Gin Val Ser Arg Gin Tyr Ser Asn Val Arg Val Thr Pro Trp Arg Met 
245 250 255 

gcc acc ate tgg gga gga gcc age etc ctg tec acc tac ctg cag age 816 
Ala Thr lie Trp Gly Gly Ala Ser Leu Leu Ser Thr Tyr Leu Gin Ser 
260 265 270 

atg egg gac etc ctg gag atg ace gac tgg ccc tgg gac ttc ttc ate 864 
Met Arg Asp Leu Leu Glu Met Thr Asp Trp Pro Trp Asp Phe Phe He 
275 280 285 

aac ctg agt gcg gcc gac tac ccc ate agg aca aat gac cag ttg gtg 912 
Asn Leu Ser Ala Ala TVsp Tyr Pro He Arg Thr Asn Asp Gin Leu Val 
290 295 300 

gcg ttt etc tec cga tac cga gat atg aat ttc ttg aag tea cac ggc 960 
Ala Phe Leu Ser Arg Tyr Arg Asp Met Asn Phe Leu Lys Ser His Gly 
305 310 315 320 
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egg gac aat gca agg ttc att egg aag cag ggc ctg gat egg etc ttc 1008 
Arg Asp Asn Ala Arg Phe lie Arg Lys Gin Gly Leu Asp Arg Leu Phe 

325 330 335 

ctg gag tgc gae get cac atg tgg cgc ctg gga gat egg egg ate cca 1056 
Leu Glu Cys Asp Ala His Met Trp Arg Leu Gly Asp Arg Arg lie Pro 
340 345 350 

gag ggc att gee gtg gat ggc ggt teg gae tgg ttc ctg ctg aac egg 1104. 
Glu Gly lie Ala Val Asp Gly Gly Ser Asp Trp Phe Leu Leu Asn Arg 
355 360 365 

agg ttt gta gaa tat gtg ace ttc tee aca gac gat ctg gtg ace aag 1152 
Arg Phe Val Glu Tyr Val Thr Phe Ser Thr Asp Asp Leu Val Thr Lys 
370 375 380 

atg aaa cag ttc tac tec tae ace ctg ett cct get gag tec ttc ttc 1200 
Met Lys Gin Phe Tyr Ser Tyr Thr Ifeu Leu Pro Ala Glu Ser Phe Phe 
385 * 390 395 400 

cat acg gtc ctg gag aac age cec cac tgc gac ace atg gtg gac aac 1248 
His Thr Val Leu X31u Asn Ser Pro His Cys Asp Thr Met Val Asp Asn 
405 410 415 

aac ctg cgc ate acc aac tgg aat cgc aag ctg ggc tgc aag tgc. cag 1296 
Asn Leu Arg lie Thr Asn Trp Asn Arg Lys Leu Gly Cys Lys Cys Gin 
420 425 430 

tac aag cac ate gtg gac tgg tgc ggc tgc tec cec aat gae ttc aag 1344 
Tyr Lys His He Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys 
435 440 445 

eeg cag gac ttc cac cgc ttc cag cag aca gee egg cct ace ttc ttt 1392 
Pro Gin Asp Phe His Arg Phe Gin Gin Thr Ala Arg Pro Thr Phe Phe 
450 455 460 

gee cgc aag ttt gaa gee gtg gtg aat cag gaa ate att ggg cag ctg 1440 
Ala Arg Lys Phe Glu Ala Val Val Asn Gin Glu He He Gly Gin Leu 
465 470 475 . 480 

gac tat tac ctg tae ggg aac tae cct gca ggt ace eeg ggc ctg cgc 1488 
Asp Tyr Tyr Leu Tyr Gly Asn Tyr Pro Ala Gly Thr Pro Gly Leu Arg 
485 490 495 

tec tae tgg gag aat gtc tac gat gag cct gac ggc ate cac age ctg 1536 
Ser Tyr Trp Glu Asn Val Tyr Asp Glu Pro Asp Gly He His Ser Leu 
500 505 510 

age gae gtg aca etc acc ttg tac cac tec ttt gee cgc ctg ggt ett 1584 
Ser Asp Val Thr Leu Thr Leu Tyr His Ser Phe Ala Arg Leu Gly Leu 
515 520 525 

ega egg get gag acg tec ctg cac acg gat ggg gag aac age tgc ega 1632 
Arg Arg Ala Glu Thr Ser Leu His Thr Asp Gly Glu Asn Ser Cys Arg 
530 535 540 

tae tac cca atg ggc cac cca gca tet gtg cac etc tac ttc ett get 1680 
Tyr Tyr Pro Met Gly His Pro Ala Ser Val His Leu Tyr Phe Leu Ala 
545 550 555 560 

gac cgc ttc cag ggc ttt ctg ate aag cat cat get acc aat ctg get 1728 
Asp Arg Phe Gin Gly Phe Leu He Lys His His Ala Thr Asn Leu Ala 
565 570 575 
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gtg age aaa eta gag act ctg gag acc tgg gtg atg ccg aaa aaa gtc 1776 

Val Ser Lys Leu Glu Thr Leu Glu Thr Trp Val Met Pro Lys Lys Val 

580 • • ' • . 585 ■ ■ 590 

ttc aag ate gca age cca ccc agt gac ttt ggg agg ctt cag ttt tec 1824 
Phe Lys lie Ala Ser Pro Pro Ser Asp Phe Gly Arg Leu Gin Phe Ser 
595 600 605 

gag gtc ggc act gac tgg gat gee aag gag agg eta ttc cgc. aac ttt 1872 
Glu Val Gly Thr Asp Trp Asp Ala Lys Glu Arg Leu Phe Arg Asn Phe 
610 615 620 

ggg ggt ctt ctg ggg ccc atg gat gag ccg gtg ggt atg cag aag tgg 1920 
Gly Gly Leu Leu Gly Pro Met Aspi Glu Pro Val Gly Met Gin Lys Trp 
625 630 635 640 

ggg aag gga cct aat gtg acc gtg acc gtc att tgg gtg gat ccc gtc 1968 
Gly Lys Gly Pro Asn Val Thr Val -Phr Val He .Trp Val Asp Pro Val 
645 650 655 

aat gtc ate gca gcc acc tac gac ate etc att gag tec act gee gaa 2016 
Asn Val He Ala Ala Thr Tyr Asp He Leu He Glu Ser Thr Ala Glu 
660 665 670 

ttc aca eac tac aag ccc cct ttg aac ttg ccc ctg agg cct ggg gtc 2064 
Phe Thr His Tyr Lys Pro Pro Leu Asn Leu Pro Leu Arg Pro Gly Val 
675 680 685 

tgg aca gtg aaa att etc cae eac tgg gtg cca gtt gca gag acc aaa 2112 
Trp Thr Val Lys He Leu His His Trp Val Pro Val Ala Glu Thr Lys 
690 695 700 

ttc etc gtt gcg cct ctg acc ttc teg aac agg cag ccc ate aaa cct 2160 
Phe Leu Val Ala Pro Leu Thr Phe Ser Asn Arg Gin Pro He Lys Pro 
705 710 715 720 

gag gag gca ctg aag ctg eac aat ggg ccc etc ege aat gcc tac atg 2208 
Glu Glu Ala Leu Lys Leu His Asn Gly Pro Leu Arg Asn Ala Tyr Met 
725 730 735 

gag cag age ttc cag age eta aac ccc gtc etc age ctg ccc ate aac 2256 
Glu Gin Ser Phe Gin Ser Leu Asn Pro Val Leu Ser Leu Pro He Asn 
740 745 750 

ccc gee cag gtg gaa cag gca egg agg aac gca gee tec acg ggc aca 2304 
Pro Ala Gin Val Glu Gin Ala Arg Arg Asn Ala Ala Ser Thr Gly Thr 
755 760 765 

gcg ctg gag gga tgg ctg gac teg ttg gtg ggc ggg atg tgg act gcc 2352 
Ala Leu Glu Gly Trp Leu Asp Ser Leu Val Gly Gly Met Trp Thr Ala 
770 775 780 

atg gac ate tgt gcc acg ggc ccc aca gcc tgc ccg gtc atg cag ace 2400 
Met Asp He Cys Ala Thr Gly Pro Thr Ala Cys Pro Val Met Gin Thr 
785 790 795 800 

tgc age cag acg gcc tgg age tee ttc age cct gac ccc aag teg gag 2448 
Cys Ser Gin Thr Ala Trp Ser Ser Phe Ser Pro Asp Pro Lys Ser Glu 
805 810 815 

ctg ggt gcc gtc aaa cct gat ggc egg etc agg tag cactgggcac 2494 
Leu Gly Ala Val Lys Pro Asp Gly Arg Leu Arg 
820 825 
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gaggagtggg ccacagcagg atctcaacgg gaaagcagcc agaggggttg tggggcctga 2554 

accccggcct ccaccctggg ggaggccctc tgtgaatggg tctctcctgg ccatagaatg 2614 

atggaaaggg- aaggtcagca ggtcaaagca ggatcagcca acaacctgcc tttggcaagc 2674 

tgcgggtggg atggctcagt ccctgcactg tgactgtctc acctcttctg sttgatcctc 2734 

aagtcctaca ggttccttgt cttccccttc cagtgaccca cccctgaccc cagacgtgtg 2794 

attttcagac ttttctttcg agcagcagaa cttcgtttac ggagcacagt cataagtgra 2854 

ggttcagggt gctgacgaaa tccaagctgc tctggttgaa gctgacaagt gcgaggttcc 2914 

ctcccaaagc tcagccctct gggcggtccc cttgcccagg gtatctccta cggtacctct 2974 

tcagaaccca agggctctgc aaatgccagt ttgacaagca ctgcccagac caaccatggg 3034 

ttcaaactcc agccctgccc tttggttcat ttttctgctt ctcttggctg ggggactctg 3094 

gtgccagcct tgaaagtcat ggtcgtgggc cctttcccat ggaggctgca gccttaggag 3154 

agctctgagc ctctcagcag ccctccttgg gttgaactat tctccttagt aactaggtaa 3214 

gtgggaaagc cttttgatgt ggcatggcca aggtccagcc acaagtgcaa ctgccacctg 3274 

tccaggggtc tgggcctcct tccctcaagg ctgccacaca aagtagcaga aataggatga 3334 

tgtttgtgag caccagactc aagaccatga ccttctttga tccttgaaaa tgggaacttt 3394 

gacagccatg accatgaaac tcacaaggca acgcggatga aactcacaaa gcaatgcttg 3454 

gagcaaaact cctgagctag acagcacagc agcacccatc ccctgccaga gcccttccgt'"35r4 

ttgaggtcag acacacaaaa ccttcgtcaa ^ttgcacaccg -gtgctgttgg gagtgaccaa 3574 

accacatgaa ccagactttt cccgtccagg aaatagcatt tcagatttgg tttttaattt 3634 

catgcccttc ggccacaggc tcaacgggac atgcaacata aaaatgggaa ggttattcaa 3694 

cccgcgggaa aaaaaaaaaa aaaaaaaaaa aa 3726 



<210> 2 

<211> 827 ...... 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Thr Gin Asp Gly Tyr Phe Ser His Arg Pro Lys Glu Lys Val Arg Thr 
15 10 15 

Asp Ser Asn Asn Glu Asn Ser Val Pro Lys Asp Phe Glu Asn Val Asp 
20 25 30 

Asn Ser Asn Phe Ala Pro Arg Thr Gin Lys Gin Lys His Gin Pro Glu 
35 40 45 

Leu Ala Lys Lys Pro. Pro Ser Arg Gin Lys Glu Leu Leu Lys Arg Lys 
50 55 60 

Leu Glu Gin Gin Glu Lys Gly Lys Gly His Thr Phe Pro Gly Lys Gly 
65 70 75 - - 80 

Pro Gly Glu Val Leu Pro Pro Gly Asp Arg Ala Ala Ala Asn Ser Ser 
85 . 90 95 

His Gly Lys Asp Val Ser Arg Pro Pro His Ala Arg Lys Thr .Gly Gly 
100 105 110 

Ser Ser Pro Glu Thr Lys Tyr Asp Gin Pro Pro Lys Cys Asp He Ser 
115 120 125 

Gly Lys Glu Ala He Ser Ala Leu Ser Arg Ala Lys Ser Lys His Cys 
130 135 140 

Arg Gin Glu He Gly Glu Thr Tyr Cys 7\rg His Lys Leu Gly Leu Leu 
145 150 155 160 

Met Pro Glu Lys Val Thr Arg Phe Cys Pro Leu Glu Gly Lys Ala Asn 
165 170 175 
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Lys Asn Val Gin Trp Asp Glu Asp Ser Val Glu Tyr Met Pro Ala Asn 
180 185 190 

Pro Val Arg He Ala Phe Val Leu Val Val His Gly Arg Ala Ser Arg 
. 195 200 205 

Gin Leu Gin Arg Met Phe Lys Ala He Tyr His Lys Asp His Phe Tyr 
210 215 220 

Tyr He His Val Asp Lys Arg Ser Asn Tyr Leu His Arg Gin Val Leu 
225 . 230 235 240 

Gin Val Ser Arg Gin Tyr Ser Asn Val Arg Val Thr Pro Trp Arg Met 
245 250 255 

Ala Thr He Trp Gly Gly Ala Ser Leu Leu Ser Thr Tyr Leu Gin Ser 

260 ■ 270 

Met Arg Asp Leu Leu Glu Met Thr Asp Trp Pro Trp Asp Phe Phe He 
275 280 285 

Asn Leu Ser Ala Ala Asp Tyr Pro He Arg Thr Asn Asp Gin Leu Val 
290 295 300 

Ala Phe Leu Ser Arg Tyr Arg Asp Met Asn Phe Leu Lys Ser His Gly 
305 310 315 320 

Arg Asp Asn Ala Arg Phe He Arg Lys Gin Gly Leu Asp Arg Leu Phe 

325 330 335 

Leu Glu Cys Asp Ala His Met Trp Arg Leu Gly Asp Arg Arg He Pro 
340 345 350 

Glu Gly He Ala Val Asp Gly Gly Ser Asp Trp Phe Leu Leu Asn Arg 
355 360 365 

Arg Phe Val Glu Tyr Val Thr Phe Ser Thr Asp Asp Leu Val Thr Lys 
370 375 380 

Met Lys Gin Phe Tyr Ser Tyr Thr Leu Leu Pro Ala Glu Ser Phe Phe 
385 390 395 400 

His Thr Val Leu Glu Asn Ser Pro His Cys Asp Thr Met Val Asp Asn 
405 410 415 

Asn Leu Arg He Thr Asn Trp .Asn Arg Lys Leu Gly Cys Lys Cys Gin 
420 425 . • 430 



Tyr Lys His He Val Asp Trp Cys 
435 440 

Pro Gin Asp Phe His Arg Phe Gin 
450 455 

Ala Arg Lys Phe Glu Ala Val Val 
465 470 

Asp Tyr Tyr Leu Tyr Gly Asn Tyr 
485 

Ser Tyr Trp Glu Asn Val Tyr Asp 
500 



Gly Cys Ser Pro Asn Asp Phe Lys 
445 

Gin Thr Ala Arg Pro Thr Phe Phe 
460 

Asn Gin Glu He He Gly Gin Leu 
475 480 

Pro Ala Gly Thr Pro Gly Leu Arg 
490 495 

Glu Pro Asp Gly He His Ser Leu 
505 510 
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Ser Asp Val Thr Leu Thr Leu Tyr His Ser Phe Ala Arg Leu Giy Leu 
515 520 525 

Arg Arg Ala Glu Thr Ser Leu His Thr Asp Gly Glu Asn Ser^ Cys Argv-. - <■ 
530 535 540 

Tyr Tyr Pro Met Gly His Pro Ala Ser Val His Leu Tyr Phe Leu Ala 
545 550 555 560 

Asp Arg Phe Gin Gly Phe Leu lie Lys His His Ala Thr Asn Leu Ala 
565 570 575 

Val Ser Lys Leu Glu Thr Leu Glu Thr Trp Val Met Pro Lys Lys Val 

580 585 590 



Phe Lys lie Ala Ser Pro Pro Ser Asp Phe Gly Arg Leu Gin Phe Ser 

595 600 605 

Glu Val Gly Thr Asp Trp Asp Ala Lys Glu Arg Leu Phe Arg Asn Phe 
610 615 620 

Gly Gly Leu Leu Gly Pro Met Asp Glu Pro Val Gly Met Gin Lys Trp 
625 630 635 640 

Gly Lys Gly Pro Asn Val Thr Val Thr Val lie Trp Val Asp Pro Val 
645 650 655 

Asn Val lie Ala Ala Thr Tyr Asp lie Leu He Glu Ser Thr Ala Glu 

660 665 670 

Phe Thr His Tyr Lys Pro Pro Leu Asn Leu Pro Leu Arg Pro Gly Val 
675 680 685 

Trp Thr Val Lys He Leu His His Trp Val Pro Val Ala Glu Thr Lys 
690 695 700 

Phe Leu Val Ala Pro Leu Thr Phe Ser Asn Arg Gin Pro He Lys Pro 
705 710 715 720 

Glu Glu Ala Leu Lys Leu His Asn Gly Pro Leu Arg Asn Ala Tyr Met 
725 730 735 

Glu. Gin Ser Phe Gin Ser Leu Asn Pro Val Leu Ser Leu Pro He Asn 
740 745 750 

Pro Ala Gin Val Glu Gin Ala Arg Arg Asn Ala Ala Ser Thr Gly Thr 
755 760 765 

Ala Leu Glu Gly Trp Leu Asp Ser Leu Val Gly Gly Met Trp Thr Ala 
770 775 780 

Met Asp He Cys Ala Thr Gly Pro Thr Ala Cys Pro Val Met Gin Thr 
785 790 795 800 

Cys Ser Gin Thr Ala Trp Ser Ser Phe Ser Pro Asp Pro Lys Ser Glu 
805 810 815 

Leu Gly Ala Val Lys Pro Asp Gly Arg Leu Arq 
820 825 
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<210> 3 

<211> 3608 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (150).. (2744) 
<223> human XT-II 

<220> 

<221> misc_feature — 

<222> (513) . . (515) 

<223> N-glycosylation site 

<220> 

<221> misc_feature 
<222> (1128) . . (1130) 
<223> N-glcosylation site 

<220> 

<221> misc_feature 
<222> (2196) . . (2198) 
<223> N-glycosylation site 

<400> 3 

cgcaggcccc gcccccggcc ccgccccggc gcgcgcccgg cccgcctccc taggcgtgga 60 
ggaggggggg cggctcagcc ccgcgcccgt gcgcgcgctc gcggccgggt tgcagggctg 120 
ggcgcgcgcc ccgcgtcccg ggcaggaag atg gtg gcg age gcg cga gtg cag 173 

Met Val Ala Ser JU.a Arg Val Gin 
1 5 

aag ctg gtg egg cgc tac aag ctg gcg att gcc acg gcg ctg gcc ate 221 
Lys Leu Val Arg Arg Tyr Lys Leu Ala lie Ala Thr Ala Leu Ala He 
10 15 20 

ctg ctg ctg cag ggc ctg gta gtg tgg age ttc age ggc ctg gag gag 269 
Leu Leu Leu Gin Gly Leu Val Val Trp Ser Phe Ser Gly Leu Glu Glu 
25 30 35 40 

gac gag gcg ggc gag aaa gga agg cag agg aag cca egg cca ctg gae 317 
Asp Glu Ala Gly Glu Lys Gly Arg Gin Arg Lys Pro Arg Pro Leu Asp 
45 50 55 



cet ggc gaa ggt tec aag gac aca gac agt tea gea ggg cga egg ggc 365 
Pro Gly Glu Gly Ser Lys Asp Thr Asp Ser Ser Ala Gly Arg Arg Gly 
60 65 70 

age aca ggc aga agg cat ggg cgc tgg egg ggc cgt get gag age cca 413 
Ser Thr Gly Arg Arg His Gly Arg Trp Arg Gly Arg Ala Glu Ser Pro 
75 80 85 

gga gtg cec gtg gcc aag gtg gta egg gea gta acc age egg cag aga 461 
Gly Val Pro Val Ala Lys Val Val Arg Ala Val Thr Ser Arg Gin Arg 
90 95 100 

gcc age egg egg gtc cca cet gee cca cet ccg gaa gee cca ggc cgc 509 
Ala Ser Arg Arg Val Pro Pro Ala Pro Pro Pro Glu Ala Pro Gly Arg 
105 110 115 120 

cag aac ctg agt ggg gea gea get ggg gag gcg ctg gta ggg gea get 557 
Gin Asn Leu Ser Gly Ala Ala Ala Gly Glu Ala Leu Val Gly Ala Ala 
125 130 135 
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ggc ttc cca cca cac gga gat aca 
Gly Phe Pro Pro His Gly Asp Thr 
140 
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ggg age gtg gag ggc gcc ccc cag 605 
Gly Ser Val Glu Gly Ala Pro Gin 
145 150 



ccc acg gac aat ggc ttc 

Pro Thr TVsp Asn Gly Phe 
155 

gca ctg tct gca ctg gcc 

Ala Leu Ser Ala Leu Ala 
170 

ate gcc aat gtg gtg tgc 

He Ala Asn Val Val Cys 

185 190 



acc ccc aag tgc gag ate 
Thr Pro Lys Cys Glu He 
160 

egg gcc age ace aag eag 
Arg Ala Ser Thr Lys Gin 
175 180 

ctg cac cag get ggg age 
Leu His Gin Ala Gly Ser 
195 



gtg ggc aag gac 653 
Val Gly Lys Asp 
165 

tgc cag cag gag 701 
Cys Gin Gin Glu 



etc atg ccc aag 749 
Leu Met Pro Lys 
200 



get gtg ccc egg cac tgt cag ctg act ggg aag atg age ccc ggc ate 797 

Ala Val Pro Arg His Cys Gin Leu ^hr Gly Lys -Met Ser Pro Gly He 
205 210 215 

eag tgg gat gag age eaa gee cag cag ccc atg gat ggc ccc ceg gtg 845 

Gin Trp Asp Glu Ser Gin Ala Gin Gin Pro Met Asp Gly Pro Pro Val 
220 225 230 



cga ate gcc tac atg ctg gtg gtt cac ggc cgc gcc ate cgc cag ctg 893 
Arg He Ala Tyr Met Leu Val Val His Gly Arg Ala He Arg Gin Leu 
235 240 245 

aag cgt etc etc aag gee gtt tat cac gag cag cac ttc ttt tac ate 941 
Lys Arg Leu Leu Lys Ala Val Tyr His Glu Gin His Phe Phe Tyr He 
250 255 260 

cat gtg gac aag cgt tec gac tac ctg cac egg gag gtg gtg gag ctg 989 
His Val Asp Lys Arg Ser Asp Tyr Leu His Arg Glu Val Val Glu Leu 
265 270 275 280 

gcc cag ggc tat gat aac gtg egg gtg acg ccc tgg cgc atg gtt acc 1037 
Ala Gin Gly Tyr Asp Asn Val Arg Val Thr Pro Trp Arg Met Val Thr 
285 290 . 295 

ate tgg ggc ggg gcc age etc ctg acg atg tac ctg egg age atg egg 1085 
He Trp Gly Gly Ala Ser Leu Leu Thr Met Tyr Leu Arg Ser Met Arg 
300 305 310 

gac ctg eta gag gtg cet ggc tgg gee tgg gac ttc ttc ate aac etc , 1133 
Asp Leu Leu Glu Val Pro Gly Trp Ala Trp Asp Phe Phe He Asn Leu 
315 320 325 

agt gee act gac tat cca acc agg acc aat gag gag ctg gtg gca ttc 1181 
Ser Ala Thr Asp Tyr Pro Thr Arg Thr Asn Glu Glu Leu Val Ala Phe 
330 335 340 

eta tec aag aac egg gac aag aat ttc etc aag tea cat ggc egg gac 1229 
Leu Ser Lys Asn Arg Asp Lys Asn Phe Leu Lys Ser His Gly. Arg Asp 

345 350 355 360 

aac tec agg ttc ate aag aaa eag ggc ctg gac egg etc ttc cat gag 1277 
Asn Ser Arg Phe He Lys Lys Gin Gly Leu Asp Arg Leu Phe His Glu 
365 370 375 

tgc gac tea cac atg tgg cgc ctg ggc gag egg cag ate cca gca ggc 1325 
Cys Asp Ser His Met Trp Arg Leu Gly Glu Arg Gin He Pro Ala Gly 
380 385 390 



wo 01/49831 



PCT/EPOO/13311 



10 / 32 

att gtg gtg gat ggc ggt tct gac tgg ttc gtg ctg aca cgc age ttt 1373 
lie Val Val Asp Gly Gly Ser Asp Trp Phe Val Leu Thr Arg Ser Phe 
395 400 405 

gtg gag tat gtg gtg tac aca gat gac ccg ctt gtg gcc cag ctg cgc 1421 
Val Glu Tyr Val Val Tyr Thr Asp Asp Pro Leu Val Ala Gin Leu Arg 
410 415 420 

cag ttc tac aca tac aca ctg etc cca gcc gag tec ttc ttc cac acg 14 69 
Gin Phe Tyr Thr Tyr Thr Leu Leu Pro Ala Glu Ser Phe Phe His Thr 
425 430 435 440 

gtg ctg gag aac age ctg gcc tgt gag acc etc gtg gac aac aac ctg 1517 
Val Leu Glu Asn Ser Leu Ala Cys Glu Thr Leu Val Asp Asn Asn Leu 

445 450 455 



egg gtc acc aac tgg aac cga aag ctg ggc tgc aag tge cag tac aag 1565 
Arg Val Thr Asn Trp Asn Arg Lys lieu Gly Cys -Lys Cys Gin Tyr Lys 
460 465 470 

cac att gtg gac tgg tgt ggc tgc tec ccc aac gac ttc aag cca cag 1613 
His lie Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys Pro Gin 
475 480 485 

gac ttc etc. egg ctg cag caa gtc tec aga ccc acc ttc ttc gee egg 1661 
Asp Phe Leu Arg Leu Gin Gin Val Ser Arg Pro Thr Phe Phe Ala Arg 
490 495 500 

aag ttc gag teg act gtg aac cag gag gtg ctg gaa ate ctg gac ttc 1709 
Lys Phe Glu Ser Thr Val Asn Gin Glu Val Leu Glu lie Leu Asp Phe " 
505 510 515 520 

cac ctg tac ggc age tac ccc ccc ggc acg cca gcc etc aag gcc tac 1757 
His Leu Tyr Gly Ser Tyr Pro Pro Gly Thr Pro Ala Leu Lys Ala Tyr 
525 530 535 

tgg gag aac ace tac gac gcg get gat ggc ccc agt ggg etc agt gat 1805 
Trp Glu Asn Thr Tyr Asp Ala Ala Asp Gly Pro Ser Gly Leu Ser Asp 
540 545 550 

gtc atg etc act get tac aca gcc ttc gee cgc etc age ctg cac cat 1853 
Val Met Leu Thr Ala Tyr Thr Ala Phe Ala Arg Leu Ser Leu His His 
555 560 565 

gcc gcc act get gca ccc cca atg ggc acc cca etc tgc agg ttt gag 1901 
Ala Ala Thr Ala Ala Pro Pro Met Gly Thr Pro Leu Cys Arg Phe Glu 
570 575 580 

ccc agg ggc ttg ccg tec age gtg cac ctg tat ttc tat gac gac cat 1949 
Pro Arg Gly Leu Pro Ser Ser Val His Leu Tyr Phe Tyr Asp Asp His 
585 590 595 600 

ttc cag ggc tac ctg gtg acg cag gcg gtg cag ccc tea gcc cag ggg 1997 
Phe Gin Gly Tyr Leu Val Thr Gin Ala Val Gin Pro Ser Ala Gin Gly 
605 610 615 

ccg gca gag acg ctt gag atg tgg ctg atg ccc caa ggg teg ctg aag 2045 
Pro Ala Glu Thr Leu Glu Met Trp Leu Met Pro Gin Gly Ser Leu Lys 
620 625 630 

ctg ttg ggg cgc agt gac cag gcc age egg etc cag agt ctg gag gtt 2093 
Leu Leu Gly Arg Ser Asp Gin Ala Ser Arg Leu Gin Ser Leu Glu Val 
635 640 645 
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ggc act gat tgg gac ccc aaa gag cgt ctt ttc egg aac ttt ggg ggg 2141 
Gly Thr Asp Trp Asp Pro Lys Glu Arg Leu Phe Arg Asn Phe Gly Gly 
650 655 660 

tta ctg ggg ccg ctg gac gag cct gtg gcc gtg cag cgc. tgg gcc egg 2189 
Leu Leu Gly Pro Leu Asp Glu Pro Val Ala Val Gin Arg Trp Ala Arg 
665 670 675 680 

ggc ccc aac etc aca gcc aca gtg gtc tgg ate gac cca acc tat gtg 2237 
Gly Pro Asn Leu Thr Ala Thr Val Val Trp lie Asp Pro Thr Tyr Val 
685 690 695 . 

gtg gcc aca tct tat gac ate aca gta gat aeg gag act gag gtc acg 2285 
Val Ala Thr Ser Tyr Asp lie Thr Val Asp Thr Glu Thr Glu Val Thr 

700 705 710 

caa tac aag ccc cca ctg age egg ccc ctg egg cca ggg ccc tgg act 2333 
Gin Tyr Lys Pro Pro Leu Ser Arg Pro Leu Arg Pro Gly Pro Ti^d Thr 
715. 720 725 

gtt ega etc ctt cag ttc tgg gaa ccg ctg ggt gag ace cgc ttc ctt 2381 
Val Arg Leu Leu Gin Phe Trp Glu Pro Leu Gly Glu Thr Arg Phe Leu 
730 735 740 

gtg ctg ccc ttg ace ttc aac cgc aaa eta cct etc agg aaa gat gat 2429 
Val Leu Pro Leu Thr Phe Asn Arg Lys Leu Pro Leu Arg Lys Asp Asp 
745 750 755 760 

gee age tgg ctg eac gea ggg cca ccc cac aac gag tac atg gag cag 2477 
Ala Ser Trp Leu His Ala Gly Pro Pro His Asn Glu Tyr Met Glu Gin 
765 770 775 

agt ttc cag ggc ctg agt age ate ctg aac ctg cct cag ccg gag etc 2525 
Ser Phe Gin Gly Leu Ser Ser lie Leu Asn Leu Pro Gin Pro Glu Leu 
780 785 790 

gcg gag gag get gcc cag egg cac aca cag etc aca ggc cct gcg etc 2573 
Ala Glu Glu Ala Ala Gin Arg His Thr Gin Leu Thr Gly Pro Ala Leu 
795 800 805 

gag gee tgg aca gac agg gaa ctg age age ttc tgg tee gtg get gga 2621 
Glu Ala Trp Thr Asp Arg Glu Leu Ser Ser Phe Trp Ser Val Ala Gly 
810 815 820 

ctg tgt gee ata ggc ccc tct ccc tgc ccc tee ctg gag ccc tgc aga 2669 
Leu Cys Ala lie Gly Pro Ser Pro Cys Pro Ser Leu Glu Pro Cys Arg 
825 830 835 840 

ctg ace age tgg age tct ctg tec ccc gac ccc aaa tea gag ctg ggg 2717 
Leu Thr Ser Trp Ser Ser Leu Ser Pro Asp Pro Lys Ser Glu Leu Gly 
845 850 855 

cct gtc aaa gea gac ggg ega etc agg tagcagggcc ccagccagta 2764 
Pro Val Lys Ala Asp Gly Arg Leu Arg 

860 865 

ecegtggagg acccgggaaa ttgeacetta cagaeagtgg aggggtgtcc ectcecacag 2824 
geaagaacca gaggcecagg ctgeacacce atttcagcea tcaagaaece acacagacgg 2884 
cagggaaggt ggaeaeagta tgaactactg etgatgtetc tgttggggat eagagggctg 2944 
gcgggaacgc gagaagggea ceagcageat teeacaccca getetteete aecttcetgt 3004 
ctagtttgaa tttctttttt ttctttcttt tttttttttt tttaatttaa aaaggaaaat 3064 
gggtggttgg gagagaaact agaacagaag atgtgeaata gggcteagag eageeceagg 3124 
aagtgggcea tgtctgtggg gageagtgge ttgetgagee cacctgetat tgttctgcae 3184 
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gaggctgggc ctgcctcact ctccagggcc tcatgcccca ttctgggcct gtggtgctcg 3244 
tggctgaggc tccacagggc tgcaagtgcc ctgccaggct ctaaggcccg aagaacaggt 3304 
gatatcgggg gcgctgcaga gctgggctct agcaggacag ttcttttgta gccaggggac 3364 
cctagaagtg gggtggggcg gctccagggc tttccagcat accctgcccc tggatgggaa 3424 
aggcagggtc agggcccact gagacccatg cagagtttcc aggtgtgcac tggaagtgag 3484 
gtcacatgag cagcgtggga agaagactct gtcaagactc tcagaagaac ctggagtaat 3544 
tgtgcctgaa gctcagcgtg aagtctgaaa tatgcaacag aagaaatata tctctatctc 3604 
tcta 3608 



<210> 4 
<211> 865 
<212> PRT 

<213> Homo sapiens 

<400> 4 . . . . . - 

Met Val Ala Ser Ala Arg Val Gin Lys Leu Val Arg Arg Tyr Lys Leu 
1 5 - 10 - 15 

Ala lie Ala Thr Ala Leu Ala lie Leu Leu Leu Gin Gly Leu Val Val 
20 25 30 

Trp Ser Phe Ser Gly Leu Glu Glu Asp Glu Ala Gly Glu Lys Gly Arg 
35 40 45 

Gin Arg Lys Pro Arg Pro Leu Asp Pro Gly Glu Gly Ser Lys Asp Thr 
50 55 .60 

Asp Ser Ser Ala Gly Arg Arg Gly Ser Thr Gly Arg Arg His Gly Arg 
65 70 75 80 

Trp Arg Gly Arg Ala Glu Ser Pro Gly Val Pro Val Ala Lys Val Val 
85 90 95 

Arg Ala Val Thr Ser Arg Gin Arg Ala Ser Arg Arg Val Pro: Pro Ala 
100 105 110 

Pro Pro Pro Glu Ala Pro Gly Arg Gin Asn Leu Ser Gly Ala Ala Ala 
.115 120 125 

Gly Glu Ala Leu Val Gly Ala Ala Gly Phe Pro Pro His Gly Asp Thr 
130 135 140 

• Gly Ser Val Glu Gly Ala Pro Gin Pro Thr Asp Asn Gly Phe Thr Pro 
145 150 155 160 

Lys Cys Glu lie Val Gly Lys Asp Ala Leu Ser Ala Leu Ala Arg Ala 

165 170 175 

Ser Thr Lys Gin Cys Gin Gin Glu lie Ala Asn Val Val Cys Leu His 
180 185 190 

Gin Ala Gly Ser Leu Met Pro Lys Ala Val Pro Arg His Cys Gin Leu 
195 200 . 205 

Thr Gly Lys Met Ser Pro Gly lie Gin Trp Asp Glu Ser Gin Ala Gin 
210 215 220 

Gin Pro Met Asp Gly Pro Pro Val Arg He Ala Tyr Met Leu Val Val 
225 230 235 240 

His Gly Arg Ala lie Arg Gin Leu Lys Arg Leu Leu Lys Ala Val Tyr 
245 250 255 
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His Glu Gin His Phe Phe Tyr lie His Val Asp Lys Arg Ser Asp Tyr 

260 265 270 

Leu His Arg Glu Val Val Glu Leu Ala Gin Gly Tyr Asp Asn Val Arg 
275 280 285 



Val Thr Pro Trp Arg Met Val Thr He Trp Gly Gly Ala Ser Leu Leu 
290 295 300 

Thr Met Tyr Leu Arg Ser Met Arg Asp Leu Leu Glu Val Pro Gly Trp 
305 310 315 320 

Ala Trp Asp Phe Phe lie Asn Leu Ser Ala Thr Asp Tyr Pro Thr Arg 
325 330 335 . 

Thr Asn Glu Glu Leu Val Ala Phe Leu Ser Lys Asn Arg Asp Lys Asn 
340 345 350 

Phe Leu Lys Ser His Gly Arg Asp Asn Ser Arg Phe He Lys Lys Gin 
355 360 365 

Gly Leu Asp Arg Leu Phe His Glu Cys Asp Ser His Met Trp Arg Leu 
370 375 380 

Gly Glu Arg Gin He Pro Ala Gly He Val Val Asp Gly. Gly Ser Asp 
385 390 395 400 

Trp Phe Val Leu Thr Arg Ser Phe Val Glu Tyr Val Val Tyr Thr Asp 

405 410 415 

Asp Pro Leu Val Ala Gin Leu Arg Gin Phe Tyr Thr Tyr Thr Leu Leu 
420 425 430 



Pro Ala Glu Ser Phe Phe His Thr Val Leu Glu Asn Ser Leu Ala Cys 
435 440 445 

Glu Thr Leu Val Asp Asn Asn Leu Arg Val Thr Asn Trp Asn Arg Lys 
450 455 460 

Leu Gly Cys Lys Cys Gin Tyr Lys His He Val Asp Trp Cys Gly Cys 
465 470 475 480 



Ser Pro Asn Asp Phe Lys Pro Gin 
485 

Ser Arg Pro Thr Phe Phe Ala Arg 
500 

Glu Val Leu Glu He Leu Asp Phe 
515 520 

Gly Thr Pro Ala Leu Lys Ala Tyr 
530 535 

Asp Gly Pro Ser Gly Leu Ser Asp 
545 550 

Phe Ala Arg Leu Ser Leu His His 
565 

Gly Thr Pro Leu Cys Arg Phe Glu 
580 



Asp Phe Leu Arg Leu Gin Gin Val 
490 495 

Lys Phe Glu Ser Thr Val Asn Gin 
505 510 

His Leu Tyr Gly Ser Tyr Pro Pro 
525 

Trp Glu Asn Thr Tyr Asp Ala Ala 
540 

Val Met Leu Thr Ala Tyr Thr Ala 
555 560 

Ala Ala Thr Ala Ala Pro Pro Met 
570 " 575 

Pro Arg Gly Leu Pro Ser Ser Val 
585 590 
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His Leu Tyr Phe Tyr Asp Asp His Phe Gin Gly Tyr Leu Val Thr Gin 
595 600 605 

Ala . Val Gin Pro Ser Ala Gin Gly Pro Ala Glu Thr Leu Glu Met Trp 
610 615 620 

Leu Met Pro Gin Gly Ser Leu Lys Leu Leu Gly Arg Ser Asp Gin Ala 
625 630 635 640 

Ser Arg Leu Gin Ser Leu Glu Val Gly Thr Asp Trp Asp Pro Lys Glu 
645 650 655 

Arg Leu Phe Arg Asn Phe Gly Gly Leu Leu Gly Pro Leu Asp Glu Pro 
660 665 670 

Val Ala Val Gin Arg Trp Ala Arg Gly Pro Asn Leu Thr Ala Thr Val 
675 680 685 

Val Trp He Asp Pro Thr Tyr Val Val Ala Thr Ser Tyr Asp He Thr 
690 695 700 

Val Asp Thr Glu Thr Glu Val Thr Gin Tyr Lys Pro Pro Leu Ser Arg 
705 710 715 720 

Pro Leu Arg Pro Gly Pro Trp Thr Val Arg Leu Leu Gin Phe Trp Glu 
725 730 735 

Pro Leu Gly Glu Thr Arg Phe Leu Val Leu Pro Leu Thr Phe Asn Arg 

740 745 750 

Lys Leu Pro Leu Arg Lys Asp Asp Ala Ser Trp Leu His Ala Gly Pro 
755 760 765 

Pro His Asn Glu Tyr Met Glu Gin Ser Phe Gin Gly Leu Ser Ser He 
770 775 780 

Leu Asn Leu Pro Gin Pro Glu Leu Ala Glu Glu Ala Ala Gin Arg His 
785 790 795 800 

Thr Gin Leu Thr Gly Pro Ala Leu Glu Ala Trp Thr Asp Arg Glu Leu 
805 810 815 



Ser Ser Phe Trp Ser Val Ala Gly Leu Cys Ala He Gly Pro Ser Pro 
820 825 830 

Cys Pro Ser Leu Glu Pro Cys. Arg Leu Thr Ser Trp Ser Ser Leu Ser 
835 840 845 

Pro Asp Pro Lys Ser Glu Leu Gly Pro Val Lys Ala Asp Gly Arg Leu 
850 855 860 



Arg 
865 



<210> 5 
<211> 2463 
<212> DNA 

<213> Rat coronavirus 

<220> 

<221> CDS 

<222> (1) . . (2463) 

<223> rat XT-I 

<400> 5 
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tac ttc tct cac egg ccc aaa gag aaa gtt egg aee gac age aae aat 48 

Tyr Phe Ser His Arg Pro Lys Glu Lys Val Arg Thr Asp Ser Asn Asn 
15 10 15 

gag aae tea gtc ccc aag gac ttt gag aat gtg gac aac age aac ttc 96 
Glu Asn Ser Val Pro Lys Asp Phe Glu Asn Val Asp Asn Ser Asn Phe 
20 25 30 

gca ccc agg act caa aag cag aag cac cag cca gag ttg gca aag aag 144 
Ala Pro Arg Thr Gin Lys Gin Lys His Gin Pro Glu Leu Ala Lys Lys 
35 40 45 

cce etc age agg cag aaa gag cgt ttg eag aga aag ctg ggc gee cag 192 
Pro Leu Ser Arg Gin Lys Glu Arg Leu Gin Arg Lys Leu Gly Ala Gin 
50 55 ■ '60 

gac aaa gga cag ggg cag tea gtc eta gga aaa ggc ccc aag gag gtg 240 
Asp Lys Gly Gin Gly Gin Ser Val Beu Gly Lys Gly Pro Lys Glu Val 
65 ' 70 75 80 

ctg ect cct egg gaa aaa gee cca ggc aac agt age caa ggg aag gat 288 
Leu Pro Pro Arg Glu Lys Ala Pro Gly Asn Ser Ser Gin Gly Lys Asp 
85 - 90 95 

etc tea aga cac age cat age agg aag agt ggt ggc ggt ggg tec ceg 336 
Leu Ser Arg His Ser His Ser Arg Lys Ser Gly Gly Gly Gly Ser Pro 
100 105 110 

gaa act aag tct gac cag gtc ccc aag tgt gac ate tct ggc aag gag 384 
Glu Thr Lys Ser Asp Gin Val Pro Lys Cys Asp lie Ser Gly Lys Glu 
115 120 125 

gcc ate tea geg ctg ace egc get aag tec aag eae tgt cge eag gag 432 
Ala lie Ser Ala Leu Thr Arg Ala Lys Ser Lys His Cys Arg Gin Glu 
130 135 140 

att gca gaa.acc tac tgt cgc cac aag ctg ggg ctg ctg atg cca gag 480 
lie Ala Glu Thr Tyr Cys Arg His Lys Leu Gly Leu Leu Met Pro Glu 
145 150 155 160 

aag gtg get cga ttc tgt ccc ctg gaa ggc aaa gcc aac aag aat gtc 528 
Lys Val Ala Arg Phe Cys Pro Leu Glu Gly Lys Ala Asn Lys Asn Val 
165 170 175 

eag tgg gat gag gat get gtt gag tac atg ccc ccc aac cca gtc aga 576 
Gin Trp Asp Glu Asp Ala Val Glu Tyr Met Pro Pro Asn Pro Val Arg 
180 185 190 

ate gcc ttt gtc ctg gtg gtc cat ggc cgt gcc tct cga cag ctg cag 624 
lie Ala Phe Val Leu Val Val His Gly Arg Ala Ser Arg Gin Leu Gin 
195 200 205 

cgc atg ttt aag gcc ate tac cac aaa gac eat ttc tac tat ate cat 672 
Arg Met Phe Lys Ala lie Tyr His Lys Asp His Phe Tyr Tyr lie His 
210 215 220 

gtg gat aag cgt tec aat tac ctg cat egg caa gtg etc cag ttc tec 720 
Val Asp Lys Arg Ser Asn Tyr Leu His Arg Gin Val Leu Gin Phe Ser 
225 230 235 240 



agg cag tac gac aac gtc cga gtc act tec tgg agg atg gcc acc att 
Arg Gin Tyr Asp Asn Val Arg Val Thr Ser Trp Arg Met Ala Thr He 
245 250 255 



768 
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tgg ggc gga gcc age etc ctg tec act tac etg cag ago atg egg gat 816 

Trp Gly Gly Ala Ser Leu Leu Ser Thr Tyr Leu Gin Ser Met Arg Asp 

260 265 ■ ' 270 

eta ctg gag atg act gac tgg cec tgg gac ttc ttc ate aac ctg agt 864 
Leu Leu Glu Met Thr Asp Trp Pro Trp Asp Phe Phe lie Asn Leu Ser 
275 280 285 



gcg^.gct gac tac ccc ate agg aca aat gac cag etc gta gea ttt ctt 912 
Ala Ala Asp Tyr Pro lie Arg Thr Asn Asp Gin Leu Val Ala Phe Leu 
290 295 300 

tec aga tac ega gat atg aat ttc ctg aag tea eat ggt egg gac aat 960- 
Ser Arg Tyr Arg Asp Met Asn Phe Leu Lys Ser His Gly Arg Asp Asn 
305 310 315 320 



gea agg ttc ate egg aag cag gac ctg gac cgt etc ttc ctg gag tgt 1008 
Ala Arg Phe lie Arg Lys Gin Asp Beu Asp Arg -Leu Phe Leu Glu Cys 
325 . 330 335 

gat aca cac atg tgg egc ctg ggg gac egg ega ate eca gag ggc att 1056 
Asp Thr His Met Trp Arg Leu Gly Asp Arg Arg lie Pro Glu Gly lie 
340 345 350 

get gta gat ggc ggc tct gat tgg ttc etg eta aac egg aag ttt gta 1104 
Ala Val Asp Gly Gly Ser Asp Trp Phe Leu Leu Asn Arg Lys Phe Val 
355 360 365 



gag tat gtg gea ttc tec aca gat gac ctg gtg ace aag atg aag cag 1152 
Glu Tyr Val Ala Phe Ser Thr Asp Asp Leu Val Thr Lys Met Lys ..Gin 
370 375 380 

ttc tac tct tat ace etc etc ect get gag tee ttt ttc cac acg gte 1200 
Phe Tyr Ser Tyr Thr Leu Leu Pro Ala Glu Ser Phe Phe His Thr Val 
385 390 395 400 

ctg gag aac age ccc cac tgt gac acc atg gtg gat aac aac ctg cgc 1248 
Leu Glu Asn Ser Pro His Cys Asp Thr Met Val Asp Asn Asn Leu Arg 
.405 410 415 

ate ace aat tgg aac cgc aag ctg ggc tgc aag tgc cag tac aag eat 1296 
lie Thr Asn Trp Asn Arg Lys Leu Gly Cys Lys Cys Gin Tyr Lys His 
420 425 430 



att gtg gac tgg tgt ggc tgc tct ccc aat gac ttc aag ect cag gac 1344 
lie Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys Pro Gin Asp 
435 440 -445 

ttc cat cgc ttc cag cag acg gee egg ccc acc ttc ttt gcc ega aag 1392 
Phe His Arg Phe Gin Gin Thr Ala Arg Pro Thr Phe Phe Ala Arg Lys 
450 455 460 

ttc gaa gcc ata gtg aac cag gaa ate att ggg cag ttg gac tct tac 1440 
Phe Glu Ala lie Val Asn Gin Glu lie He Gly Gin Leu Asp Ser Tyr 
465 470 475 480 

ctg tat ggc aac tat ect geg ggc acc ccg ggc etc cgc tct tac tgg 1488 
Leu Tyr Gly Asn Tyr Pro Ala Gly Thr Pro Gly Leu Arg Ser Tyr Trp 
485 490 495 

gag aat gte tat gac gaa cca gat ggc ate cag age etc age gat gtg 1536 
Glu Asn Val Tyr Asp Glu Pro Asp Gly He Gin Ser Leu Ser Asp Val 
500 505 510 
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gca etc acc atg tat cat tec ttc ate cgc ctg ggt ctt cga agg get 1584 
Ala Leu Thr Met Tyr His Ser Phe lie Arg Leu Gly Leu Arg Arg Ala 
515 ■ 520 525 

gag tea teg eta cae aeg gat ggg gag aac age tge agg tac tat cca 1632 
Glu Ser Ser Leu His Thr Asp Gly Glu Asn Ser Cys Arg Tyr Tyr Pro 
530 535 540 

atg ggc cac cca gcg tet gtc cac etc tac ttc ctg get gac cga ttc. 1680 
Met Gly His Pro Ala Ser Val His Leu Tyr Phe Leu Ala Asp Arg Phe 
545 550 555 560 

cag ggc ttt ctg ate aag cac cat gtg acc aac ctt get gtg age aaa 1728 
Gin Gly Phe LeU lie Lys His His Val Thr Asn Leu Ala Val Ser Lys 
565 570 575 

ctg gag aca ctg gag. aca tgg atg atg cca aag aaa gtc ttc aag gtc 1776 
Leu Glu Thr Leu Glu Thr Trp Met Met Pro Lys -Lys Val Phe Lys Val 
580 585 590 

gca agt ccc ccc agt gac ttt ggg agg ctt cag ttt tet gag gtt ggc 1824 
Ala Ser Pro Pro Ser Asp Phe Gly Arg Leu Gin Phe Ser Glu Val Gly 
595 600 605 

act gac tgg gat gee aag gag aga eta ttc egg aac ttt ggt ggt ctt 1872 
Thr Asp Trp Asp Ala Lys Glu Arg Leu Phe Arg Asn Phe Gly Gly Leu 
610 615 620 

ttg ggg ccc atg gat gag ccg gtg gga atg cag aaa tgg ggg aag gga 1920 
Leu Gly Pro Met Asp Glu Pro Val Gly Met Gin Lys Trp Gly Lys Gly 
625 630 635 640 

ccc aat gtg acc gtg act gtt att tgg gtg gat cct gtc aat gtc ate 1968 
Pro Asn Val Thr Val Thr Val lie Trp Val Asp Pro Val Asn Val lie 
645 650 655 

gca gee acc tat gat ate ctg ate gag tec act gcg gaa ttc aca cae 2016 
Ala Ala Thr Tyr Asp lie Leu lie Glu Ser Thr Ala Glu Phe Thr His 
660 665 670 

tac aaa ccc cct ttg aat ctg cct ctg agg cct ggt gtc tgg aca gtg 2064 
Tyr Lys Pro Pro Leu Asn Leu Pro Leu Arg Pro Gly Val Trp Thr Val 
675 680 685 

aag att etc cat cae tgg gta cca gtt gca gag ace aaa ttc ctt gtc 2112 
Lys lie Leu His His Trp Val Pro Val Ala Glu Thr Lys Phe Leu Val 
690 695 700 

gca cct ctg acc ttc tec aac aag cag ccc ate aaa cca gag gag get 2160 
Ala Pro Leu Thr Phe Ser Asn Lys Gin Pro lie Lys Pro Glu Glu Ala 
705 710 715 720 

ttg aag ctg cac aat ggg cca cct cgc agt gee tac atg gag cag agt 2208 
Leu Lys Leu His Asn Gly Pro Pro Arg Ser Ala Tyr Met Glu Gin Ser 
725 730 735 

ttc cag age eta aac cca gtc etc age ctg cac ate aat cct gee caa 2256 
Phe Gin Ser Leu Asn Pro Val Leu Ser Leu His lie Asn Pro Ala Gin 
740 745 750 

gtg gag cag gcc egg aag aac gca gee ttc act ggg aca gca eta gaa 2304 
Val Glu Gin Ala Arg Lys Asn Ala Ala Phe Thr Gly Thr Ala Leu Glu 
755 760 765 
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gcc tgg ctg gtg ggt ggg act tgg act gcc atg gac gtc tgt gee aca 2352 
Ala Trp Leu .Val Gly Gly Thr Trp Thr Ala Met Asp Val Cys Ala Thr 
770 775 780 

ggc GCC act gcc tgc cca gtc atg caa acc tgc age caa aca gcc tgg 2400 

Gly Pro Thr Ala Cys Pro Val Met Gin Thr Cys Ser Gin Thr Ala Trp 

785 790 795 . 800 

agt tec ttc age cot gat ccc aag tea gag etg ggt gca gtg aag cct 2448 
Ser Ser Phe Ser Pro Asp Pro Lys Ser Glu Leu Gly Ala Val Lys Pro 
805 810 815 

gat ggt egg etc agg 2463 
Asp Gly Arg Leu Arg 
820 



<210> 6 
<211> 821 
<212> PRT 

<213> Rat corona virus 
<4G0> 6 

Tyr Phe Ser His Arg Pro Lys Glu Lys Val Arg Thr Asp Ser Asn Asn 
1 "5 . 10 15 

Glu Asn Ser Val Pro Lys Asp Phe Glu Asn Val Asp Asn Ser Asn Phe 

20 25 30 

Ala Pro Arg Thr Gin Lys Gin Lys His Gin Pro Glu Leu Ala Lys Lys 
35 40 45 

Pro Leu Ser Arg Gin Lys Glu Arg Leu Gin Arg Lys Leu Gly Ala Gin 
50 55 60 

Asp Lys Gly Gin Gly Gin Ser Val Leu Gly Lys Gly Pro Lys Glu Val 
65 70 75 80 

Leu Pro Pro Arg Glu Lys Ala Pro Gly Asn Ser Ser Gin Gly Lys Asp 
85 90 95 

Leu Ser Arg His Ser His Ser Arg Lys Ser Gly Gly Gly Gly Ser Pro 
100 105 110 

Glu Thr Lys Ser Asp Gin Val Pro Lys Cys Asp He Ser Gly Lys Glu 
115 120 .125 

Ala He Ser Ala Leu Thr Arg Ala Lys Ser Lys His Cys Arg Gin Glu 
130 135 140 

He Ala Glu Thr Tyr Cys Arg His Lys Leu Gly Leu Leu Met Pro Glu 
145 150 155 160 

Lys Val Ala Arg Phe Cys Pro Leu Glu Gly Lys Ala Asn Lys Asn Val 
165 170 175 

Gin Trp Asp Glu Asp Ala Val Glu Tyr Met Pro Pro Asn Pro Val Arg 
180 185 190 

He Ala Phe Val Leu Val Val His Gly Arg Ala Ser Arg Gin Leu Gin 
195 200 205 
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Arg Met Phe Lys Ala lie Tyr His Lys Asp His Phe Tyr Tyr lie His 
210 215 220 

Val Asp Lys Arg Ser Asn Tyr Leu His Arg Gin Val -Leu Gin Phe Ser 
225 230 235 240 

Arg Gin Tyr Asp Asn Val Arg Val Thr Ser Trp Arg Met Ala Thr lie 
•245 250 255 

Trp Gly Gly Ala Ser Leu Leu Ser Thr Tyr Leu Gin Ser Met Arg Asp 
260 265 270 

Leu Leu Glu Met Thr Asp Trp Pro Trp Asp Phe Phe lie Asn Leu Ser 
275 280 285 

Ala Ala Asp Tyr Pro lie Arg Thr Asn Asp Gin Leu Val Ala Phe Leu 
290 295 300 

Ser Arg Tyr Arg Asp Met Asn Phe Leu Lys Ser His Gly Arg Asp Asn 
305 310 315 320 

Ala Arg Phe lie Arg Lys Gin Asp Leu Asp Arg Leu Phe Leu Glu Cys 
325 330 335 

Asp Thr His Met Trp Arg Leu Gly Asp Arg Arg lie Pro Glu Gly lie 
. -340 345 350 

Ala Val Asp Gly Gly Ser Asp Trp Phe Leu Leu Asn Arg Lys Phe Val 
355 360 365 

Glu Tyr Val Ala Phe Ser Thr Asp Asp Leu Val Thr Lys Met Lys Gin 
370 • 375 380 

Phe Tyr Ser Tyr Thr Leu Leu Pro Ala Glu Ser Phe Phe His Thr Val 
385 390 395 400 

Leu Glu Asn Ser Pro His Cys Asp Thr Met Val Asp Asn Asn Leu Arg 
405 410 415 

lie Thr Asn Trp Asn Arg Lys Leu Gly Cys Lys Cys Gin Tyr Lys His 
420 425 430 

lie Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys Pro Gin Asp 
435 440 445 

Phe His Arg Phe Gin Gin Thr Ala Arg Pro Thr Phe Phe Ala Arg Lys 
450 455 460 

Phe Glu Ala He Val Asn Gin Glu He He Gly Gin Leu Asp Ser Tyr 
465 470 475 480 

Leu Tyr Gly Asn Tyr Pro Ala Gly Thr Pro Gly Leu Arg Ser Tyr Trp 
485 490 495 

Glu Asn Val Tyr Asp Glu Pro Asp Gly He Gin Ser Leu Ser Asp Val 
500 505 510 

Ala Leu Thr Met Tyr His Ser Phe He Arg Leu Gly Leu Arg Arg Ala 
515 520 525 



Glu Ser Ser Leu His Thr Asp Gly Glu Asn Ser Cys Arg Tyr Tyr Pro 
530 535 540 
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Met Gly His Pro Ala Ser Val His Leu Tyr Phe Leu Ala Asp Arg Phe 
545 550 . 555 560 

Gin Gly Phe Leu lie Lys His His Val Thr Asn Leu Ala Val Ser Lys 
565 570 575 

Leu Glu Thr Leu Glu Thr Trp Met Met Pro Lys Lys Val Phe Lys Val 

580 585 590 

Ala Ser Pro Pro Ser Asp Phe Gly Arg Leu Gin Phe Ser Glu Val Gly 
595 600 605 

Thr Asp Trp Asp Ala Lys Glu Arg Leu Phe Arg Asn Phe Gly Gly Leu 
610 615 620 

Leu Gly Pro Met Asp Glu Pro Val Gly Met Gin Lys Trp Gly Lys Gly 
625' 630 635 640 

Pro Asn Val Thr Val Thr Val lie Trp Val Asp Pro Val Asn Val He 
645 650 655 

Ala Ala Thr Tyr Asp He Leu He Glu Ser Thr Ala Glu Phe Thr His 
660 665 670 

Tyr Lys Pro Pro Leu Asn Leu Pro Leu Arg Pro Gly Val T!rp Thr Val 
675 680 685 

Lys He Leu His His Trp Val Pro Val Ala Glu Thr Lys Phe Leu Val 
690 695 700 

Ala Pro Leu Thr Phe Ser Asn Lys Gin Pro He Lys Pro Glu Glu Ala 
705 710 715 720 

Leu Lys Leu His Asn Gly Pro Pro Arg Ser Ala Tyr Met -Glu Gin Ser 
725 730 735 

Phe Gin Ser Leu Asn Pro Val Leu Ser Leu His He Asn Pro Ala Gin 
740 745 750 

Val Glu Gin Ala Arg Lys Asn Ala Ala Phe Thr Gly Thr Ala Leu Glu 
755 760 765 

Ala Trp Leu Val Gly Gly Thr Trp Thr Ala Met Asp Val Cys Ala Thr 
770 775 780 

Gly Pro Thr Ala Cys Pro Val Met Gin Thr Cys Ser Gin Thr Ala Trp 
785 790 795 • ' 800 

Ser Ser Phe Ser Pro Asp Pro Lys Ser Glu Leu Gly Ala Val Lys Pro 
805 810 815 

Asp Gly Arg Leu Arg 
820 



<210> 7 
<211> 2858 
<212> DNA 

<213> Rat coronavirus 

<220> 

<221> CDS 

<222> (56) . . (2650) 
<223> rat XT-2 
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<400> 7 

gcgctcgcgg ccgggttgca gggccggacg cgcgccccgc gtcccgggca ggaag atg 58 

Met 
1 

gtg gcg age gcg egg gtg cag aag ctg gtg egg cgc tac aag ctg gcg 106 
Val Ala Ser Ala Arg Val Gin Lys Leu Val Arg Arg Tyr Lys Leu Ala 
5 10 15 

ate gee ace gcg ctg gcc ate ctg ctg ctg cag ggc ctg gtg gtg tgg 154 
lie Ala Thr Ala Leu Ala lie Leu Leu Leu Gin Gly Leii Val Val Trp 
20 25 30 

age ttc age ggc ctg gag gag gac gag ccg ggc gag aaa gga agg cag 202 
Ser Phe Ser Gly Leu Glu Glu Asp Glu Pro Gly Glu Lys Gly Arg Gin 
35 40 45 

aga aag cca egg ccg eta gac cet gga gag ggt tec aag gac acg gac 250 
Arg Lys Pro Arg Pro Leu Asp Pro Gly Glu Gly Ser Lys Asp Thr Asp 
50 ' 55 60 65 

age tet gca ggg cgt agg" ggc agt get ggg aga agg cac gga egg tgg 298 
Ser Ser Ala Gly Arg Arg Gly Ser Ala Gly Arg Arg His Gly Arg Trp 
70 75 80 

egg ggc cgc gcc gag age cca ggt gtg cet gtg gcc aag gtg gtg egg 346 
Arg Gly Arg Ala Glu Ser Pro Gly Val Pro Val Ala Lys Val Val Arg 
85 90 95 

gca gta ace agt egg cag aga gcc age ega egg gtg ccc cet gcc eca 394 
Ala Val Thr Ser Arg Gin Arg Ala Ser Arg Arg Val Pro Pro Ala Pro 
100 105 110 

cet cca gag gca cca ggc cgc cag aac ctg agt gga gca gca gcc ggg 442 
Pro Pro Glu Ala Pro Gly Arg Gin Asn Leu Ser Gly Ala Ala Ala Gly 
115 120 125 

gaa gca ctg at a ggt gcc gee ggc tte cea caa cat gga gac aca ggg 490 
Glu Ala Leu lie Gly Ala Ala Gly Phe Pro Gin His Gly Asp Thr Gly 
130 135 140 145 

agt gtg gag ggt gcc cet cag ccc acg gat aac age ttc act ccg aag 538 
Ser Val Glu Gly Ala Pro Gin Pro Thr Asp Asn Ser Phe Thr Pro Lys 
150 , 155 160 

tgt gag att gtg ggc aag gat gca ctg tea gca ctg gee egg gcc age 586 
Cys Glu lie Val Gly Lys Asp Ala Leu Ser Ala Leu Ala Arg Ala Ser 
165 170 175 

ace aag cat tgt caa caa gag att get aat gta gtg tgc ctg cac caa 634 
Thr Lys His Cys Gin Gin Glu lie Ala Asn Val Val Cys Leu His Gin 
180 185 190 

get ggg aac eta atg ccc aag tct gtg ccc egg cac tgc cag ctg get 682 
Ala Gly Asn Leu Met Pro Lys Ser Val Pro Arg His Cys Gin Leu Ala 
195 200 205 

ggc aag gtg age cet ggc ate cag tgg gaa gag gtc egg gee cag cag 730 
Gly Lys Val Ser Pro Gly lie Gin Trp Glu Glu Val Arg Ala Gin Gin 
210 215 220 225 

cet gtg agt ggc cet ctg gta cgc ate gcc tac atg ctg gtg gtt cac 778 
Pro Val Ser Gly Pro Leu Val Arg He Ala Tyr Met Leu Val Val His 
230 235 240 
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ggc cgt get gtg cgc cag ctg aag cgt ctt ctg aag gcc gtc tac cac 826 
Gly Arg Ala Val Arg Gin Leu Lys Arg Leu Leu Lys Ala Val Tyr His 
245 250 255 

gag gag cac ttc ttt tat att cat gtg gac aag cgt tec aac tac ctg 874 
Glu Glu His Phe Phe Tyr lie His Val Asp Lys Arg Ser Asn Tyr Leu 

260 265 -r./:. 270 

tac egg gag gtg gta gag ctg gcc cag cac tac gac aat gta egg gta 922 
Tyr Arg Glu Val Val Glu Leu Ala Gin His Tyr Asp Asn Val Arg Val 
275 280 ^ 285 

aca cct tgg cgc atg gtc acc ate tgg ggt ggg get age ctt ctg agg 970 
Thr Pro Trp hrq Met Val Thr lie Trp Gly Gly Ala Ser Leu Leu TVrg 
290 295 300 305 

atg tac ctg egg age atg aag gac ctt ctg gag act cct ggc tgg acc 1018 
Met Tyr Leu Arg Ser Met Lys Asp iJeu Leu Glu -Thr Pro Gly Trp Thr 
310 315 320 

tgg gac ttc ttc ate aac ctg age get act gac tat eca ace agg aeg 1066 
Trp Asp Phe Phe He Asn Leu Ser Ala Thr Asp Tyr Pro Thr Arg Thr 
325 330 335 

aat gag gag ctg gta gcg ttc tta tec aag aac egg gac aag aat ttc 1114 
Asn Glu Glu Leu Val Ala Phe Leu Ser Lys Asn Arg Asp -Lys Asn Phe 
340 345 350 

etc aag tea cac ggg cga gac aat tec agg ttc ate aag aaa caa ggc 1162 
Leu Lys Ser His Gly Arg Asp Asn Ser Arg Phe lie Lys Lys Gin Gly 
355 360 365 

ctg gac egg ctt ttc cat gag tgt gat tec cac atg tgg cgc ctg ggt 1210 
Leu Asp Arg Leu Phe His Glu Cys Asp Ser His Met Trp Arg Leu Gly 
370 375 380 385 

gaa egg cag ate ccg gca ggc ate gtg gtg gat ggt ggc tet gac tgg 1258 
Glu Arg Gin He Pro Ala Gly He Val Val Asp Gly Gly Ser Asp Trp 
390 395 400 

ttc gtg ctg aca cgc age ttt gtg gaa tat gtg gtg tat aca gag gat 1306 
Phe Val Leu Thr Arg Ser Phe Val Glu Tyr .Val Val Tyr Thr Glu Asp 
405 410 415 

cct ctg gtg gee cag ctt cgc cag ttc tat aca tac aca ttg ctt eca 1354 
Pro Leu Val Ala Gin Leu Arg Gin Phe Tyr Thr Tyr Thr Leu Leu Pro 
420 425 430 

gee gag tec ttc ttc cac aca gtg ctg gag aac age eca gee tgt gag 1402 
Ala Glu Ser Phe Phe His Thr Val Leu Glu Asn Ser Pro Ala Cys Glu 
435 440 445 

age eta gtg gac aac aac ctg egg gtt ace aac tgg aac egg aag ctg 1450 
Ser Leu Val Asp Asn Asn Leu Arg Val Thr Asn Trp Asn Arg Lys Leu 
450 455 460 465 

ggc tgc aag tgc cag tac aag cac ate gtg gac tgg tgt ggc tgc tec 1498 
Gly Cys Lys Cys Gin Tyr Lys His He Val Asp Trp Cys Gly Cys Ser 
470 475 480 

ecc aac gac ttc aag eca cag gac ttc ctg egg ctt cag caa gtc tec 1546 
Pro Asn Asp Phe Lys Pro Gin Asp Phe Leu Arg Leu Gin Gin Val Ser 
485 490 495 
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aga ccc acc ttc ttt gcc egg aag ttt gag teg act gtg aac cag gaa 1594 
Arg Pro Thr Phe Phe Ala Arg Lys Phe Glu Ser Thr Val Asn Gin Glu 
500 505 510 

gtc ctg gaa att ttg gac ttc cac ctg tat ggc age tac cca ccc ggc 1642 
Val Leu Glu lie Leu Asp Phe His Leu Tyr Gly Ser Tyr Pro Pro Gly 
515 520 525 . • - 

acc cca gcc etc aag gcc tac tgg gag aac ate tac gac atg gcc gat 1690 
Thr Pro Ala Leu Lys Ala Tyr Trjp Glu Asn lie Tyr Asp Met Ala Asp 
530 535 540 545 

ggc ect agt gga etc age gat gtc eta etc act get tac aca gee ttt 1738 
Gly Pro Ser Gly Leu Ser Asp Val Leu Leu Thr Ala Tyr Thr Ala Phe ' 

550 555 560 

gee cgt ate agt ctg cgt eat get gee act gtt tec cca ctg gee act 1786 
Ala Arg lie Ser Leu Arg His Ala l^la Thr Val -Ser Pro Leu Ala Thr 
565 570 575 

gca gtc tgc agg ttt gag ccc agg ggg ttg ccg tec age gtg cac ctg 1834 
Ala Val Cys Arg Phe Glu Pro Arg Gly Leu Pro Ser Ser Val His Leu 
580 585 590 

tat ttc tat gac gac eat ttc cag ggc tac ctg. gtg acg cag gca gtg 1882 
Tyr Phe Tyr Asp Asp His Phe Gin Gly Tyr Leu Val Thr Gin Ala Val 
595 600 605 

cag ccc tea gcc cag ggg cca gca gag aca ett gag atg tgg ctg atg 1930 
Gin Pro Ser Ala Gin Gly Pro Ala Glu Thr Leu Glu Met Trp Leu Met 
610 615 620 625 

ccc cag agg ttg ctg aag ccg ttg ggg cac agt gac cag gcc age egg 1978 
Pro Gin Arg Leu Leu Lys Pro Leu Gly His Ser Asp Gin Ala Ser Arg 
630 635 640 

etc cag agt ctg gag gtt ggc act gag tgg gac ccc aaa gaa cgt etc 2026 
Leu Gin Ser Leu Glu Val Gly Thr Glu Trp Asp Pro Lys Glu Arg Leu 
645 650 655 

ttc egg aac ttt ggg ggc ctg ttg gga cca ctg gat gaa cet gtg gee 2074 
Phe Arg Asn Phe Gly Gly Leu Leu Gly Pro Leu Asp Glu Pro Val Ala 
660 665 670 

atg cag cgc tgg gcc egg ggc ccc aac etc aca gcc act gtg gtc tgg 2122 
Met Gin Arg Trp Ala Arg Gly Pro Asn Leu Thr Ala Thr Val Val Trp 
675 680 685 

att gac ccc acc tat gtt gtg gcc aca tec tat gac ate acg gta gat 2170 
lie Asp Pro Thr Tyr Val Val Ala Thr Ser Tyr Asp He Thr Val Asp 
690 695 700 705 

gcg gac act gaa gtc acg cag tae aag ccc cca ctg age ctg cca ctg 2218 
Ala Asp Thr Glu Val Thr Gin Tyr Lys Pro Pro Leu Ser Leu Pro Leu 
710 715 720 

egg cca gga gcc tgg act gtt cga ttg ett cag ttc tgg gag ccc ctg 2266 
Arg Pro Gly Ala Trp Thr Val Arg Leu Leu Gin Phe Trp Glu Pro Leu 
725 730 735 

ggt gag ace cgc ttc etc gtg ctg cca ttg ace ttc aac cac aaa eta 2314 
Gly Glu Thr Arg Phe Leu Val Leu Pro Leu Thr Phe Asn His Lys Leii 
740 745 750 
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cct etc agg aaa gat gat gcc age tgg etg cat gcg gga eca ccc cac 2362 
Pro Leu Arg Lys Asp Asp Ala Ser Trp Leu His Ala Gly Pro Pro His 
755 760 765 

aac gaa tac atg gaa cag agt ttc cag gga eta agt ggc ate etg aat 2410 
Asn Glu Tyr Met Glu Gin Ser Phe Gin Gly Leu Ser Gly He Leu Asn 
770 775 780 785 

etg cct pag_cca gag gee gtg gag gag get gcc egg egg cae aca gag 2458 
Leu Pro Gin Pro Glu Ala Val Glu Glu. Ala Ala Arg Arg His Thr Glu 
790 795 800 

etc aca ggt cct gca ctt gag gcc tgg aca gat ggg gaa etg age agt 2506 
Leu Thr Gly Pro Ala Leu Glu Ala Trp Thr Asp Gly Glu Leu Ser Ser 
805 810 815 

ttc tgg tct gtt gca gga ttg tgt gee ata ggg cct tct tet tgt ccc 2554 
Phe Trp Ser Val Ala Gly Leu Cys Ala He Gly -Pro Ser Ser Cys Pro 
820 825 830 

tec etg gag etc tge aga etg ace age tgg age tet etg tct eet gae 2602 
Ser Leu Glu Leu Cys Arg Leu Thr Ser Trp Ser Ser Leu Ser Pro Asp 
835 840 845 

ccc aag tea gag etg ggg cct gtc aaa get gac gga cga etc agg tag 2650 
Pro Lys Ser Glu Leu Gly Pro Val Lys Ala Asp Gly Arg Leu Arg 
850 855 860 865 

caggaceeca gccagcacaa eccggaggag ceggggaatt geaeettaea gaeaatggag 2710 

ggaegtctct cetctggtag ecagggagce ctagaaeagt ggtggggetg tctgtctgtc 2770 

aggctttcet aggaaatgga cgacatcagg ccctgctgag atccatgaag tttccccact 2830 

ggaggaggga gccccactgg cagcacag 2858 



<210> 8 
<211> 864 
<212> PRT 

<213> Rat corona virus 
<400> 8 

Met Val Ala Ser Ala Arg Val Gin Lys Leu Val Arg Arg Tyr Lys Leu 
15 10 15 

Ala He Ala Thr Ala Leu Ala He Leu Leu Leu Gin Gly Leu Val Val 
20 25 30 

Trp Ser Phe Ser Gly Leu Glu Glu Asp Glu Pro Gly Glu Lys Gly Arg 
35 40 45 

Gin Arg Lys Pro Arg Pro Leu Asp Pro Gly Glu Gly Ser Lys Asp Thr 
50 55 60 . 

Asp Ser Ser Ala Gly Arg Arg Gly Ser Ala Gly Arg Arg His Gly Arg 
65 70 75 80 

Trp Arg Gly hxg Ala Glu Ser Pro Gly Val Pro Val Ala Lys Val Val 
85 90 95 

Arg Ala Val Thr Ser Arg Gin Arg Ala Ser Arg Arg Val Pro Pro Ala 
100 105 110 

Pro Pro Pro Glu Ala Pro Gly Arg Gin Asn .Leu Ser Gly Ala Ala Ala 
115 120 125 
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Gly Glu Ala Leu lie Gly Ala Ala Gly Phe Pro Gin His Gly Asp Thr 

130 135 140 

Gly Ser Val Glu Gly Ala Pro Gin Pro Thr Asp Asn Ser Phe Thr Pro 
145 150 155 160 

Lys Cys Glu He Val Gly Lys Asp Ala Leu Ser Ala Leu Ala Arg Ala 
165 170 175 

Ser Thr Lys His Cys Gin Gin Glu He Ala Asn Val Val Cys Leu His 
180 185 190 

Gin Ala Gly Asn Leu Met Pro Lys Ser Val Pro Arg His Cys Gin Leu 
195 200 205 ' 

Ala Gly Lys Val Ser Pro Gly He Gin Trp Glu Glu Val Arg Ala Gin 

210 215 220 

Gin Pro Val Ser Gly Pro Leu Val Arg He Ala Tyr Met Leu Val Val 
225 230 235 240 

His Gly Arg Ala Val Arg Gin Leu Lys Arg Leu Leu Lys Ala Val Tyr 
245 250 255 

His Glu Glu His Phe Phe Tyr He His Val Asp Lys Arg Ser Asn Tyr 
260 265 270 

Leu Tyr Arg Glu Val Val Glu Leu Ala Gin His Tyr Asp Asn Val Arg 
275 280 285 

Val Thr Pro Trp Arg Met Val Thr He Trp Gly Gly Ala Ser Leu Leu 
290 295 300 

Arg Met Tyr Leu Arg Ser Met Lys Asp Leu Leu Glu Thr Pro Gly Trp 
305 310 315 320 

Thr Trp Asp Phe Phe He Asn Leu Ser Ala Thr Asp Tyr Pro Thr Arg 
325 330 335 

Thr Asn Glu Glu Leu Val Ala Phe Leu Ser Lys Asn Arg Asp Lys Asn 
340 345 350 

Phe Leu Lys Ser His Gly Arg Asp Asn Ser Arg Phe He Lys Lys Gin 
355 360 365 

Gly Leu Asp Arg Leu Phe His Glu Cys Asp Ser His Met Trp Arg Leu 
370 375 380 

Gly Glu Arg Gin He Pro Ala Gly He Val Val Asp Gly Gly Ser Asp 
385 390 395 400 

Trp Phe Val Leu Thr Arg Ser Phe Val Glu Tyr Val Val Tyr Thr Glu 
405 410 415 

Asp Pro Leu Val Ala Gin Leu Arg Gin Phe Tyr Thr Tyr Thr Leu Leu 
420 425 430 

Pro Ala Glu Ser Phe Phe His Thr Val Leu Glu Asn Ser Pro Ala Cys 
435 440 445 

Glu Ser Leu Val Asp Asn Asn Leu Arg Val Thr Asn Trp Asn Arg Lys 
450 455 460 
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Leu Gly Cys Lys Cys Gin Tyr Lys His He Val Asp Trp Cys Gly Cys 
465 470 475 480 

Ser Pro Asn Asp Phe Lys Pro Gin Asp Phe Leu Arg Leu Gin Gin Val 
485 490 495 

Ser Arg Pro Thr Phe Phe Ala Arg Lys Phe Glu Ser Thr Val Asn Gin 
500 505 510 

Glu Val Leu Glu He Leu Asp Phe His Leu Tyr Gly Ser Tyr Pro Pro 
515 520 525 

Gly Thr Pro Ala Leu Lys Ala Tyr Trp Glu Asn He Tyr Asp Met Ala 
530 535 540 

Asp Gly Pro Ser Gly Leu Ser Asp Val Leu Leu Thr Ala Tyr Thr Ala 
545 550 555 560 

Phe Ala Arg He Ser Leu Arg His Ala Ala Thr Val Ser Pro Leu Ala 
565 570 575 

Thr Ala Val Cys Arg Phe Glu Pro Arg Gly Leu Pro Ser Ser Val His 
580 585 590 

Leu Tyr Phe Tyr Asp Asp His Phe Gin Gly Tyr Leu Val Thr Gin Ala 
595 600 605 

Val Gin Pro Ser Ala Gin Gly Pro Ala Glu Thr Leu Glu Met Trp Leu 
610 615 620 

Met Pro Gin Arg Leu Leu Lys Pro Leu Gly His Ser Asp Gin Ala Ser 
625 630 635 640 

Arg Leu Gin Ser Leu Glu Val Gly Thr Glu Trp Asp Pro Lys Glu Arg 
645 650 655 

Leu Phe Arg Asn Phe Gly Gly Leu Leu Gly Pro Leu Asp Glu Pro Val 
660 665 670 

Ala Met Gin Arg Trp Ala Arg Gly Pro Asn Leu Thr Ala Thr Val Val 
675 680 685 

Trp He Asp Pro Thr Tyr Val Val Ala Thr Ser Tyr Asp He Thr Val 
690 695 700 

Asp Ala Asp Thr Glu Val Thr Gin Tyr Lys Pro Pro Leu Ser Leu Pro 
705 710 715 720 

Leu Arg Pro Gly Ala Trp Thr Val Arg Leu Leu Gin Phe Trp Glu Pro 
725 730 735 

Leu Gly Glu Thr Arg Phe Leu Val Leu Pro Leu Thr Phe Asn His Lys 
740 745 750 

Leu Pro Leu Arg Lys Asp Asp Ala Ser Trp Leu His Ala Gly Pro Pro 
755 760 765 

His Asn Glu Tyr Met Glu Gin Ser Phe Gin Gly Leu Ser Gly He Leu 
770 775 .780 



Asn Leu Pro Gin Pro Glu Ala Val Glu Glu Ala Ala Arg Arg His Thr 
785 790 795 800 
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Glu Leu Thr Gly Pro Ala Leu Glu Ala Trp Thr Asp Gly Glu Leu Ser 
805 810 815 

Ser Phe Trp Ser Val Ala Gly Leu Cys Ala He Gly Pro Ser Ser Cys 
820 825 830 

Pro Ser Leu Glu Leu Cys Arg Leu Thr Ser Trp Ser Ser Leu Ser Pro 
835 840 845 

Asp Pro Lys Ser Glu Leu Gly Pro Val Lys Ala Asp Gly Arg Leu Arg 
850 855 860 



<210> 9 

<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 

<400> 9 

Cys Ser Arg Gin Lys Glu Leu Leu Lys Arg Lys Leu Glu Gin Gin Glu 
1 5 10 15 

Lys 



<210> 10 : . ■ 

<211> 11 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: peptide 
<400> 10 

Gin Glu Glu Glu Gly Ser Gly Gly Gly Gin Lys 
15 10 

<210> 11 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
<400> 11 

Ser Glu Asn Glu Gly Ser Gly Met Ala Glu Gin Lys 
15 10 

<210> 12 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
<400> 12 

Thr Glu Asn Glu Gly Ser Gly Leu Thr Asn He Lys 
15 10 
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<210> 13 

<211> 28 

<212> DNA - . 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 13 

gtgggtatgc agaagtgggg gaagggac 

<210> 14 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 14 

ccctccgcaa tgcctaca 



<210> 15 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 15 

actgaggtca cgcaatacaa 



<210> 16 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 16 

gccgcactca ggttgatgaa gaagt 

<210> 17 
<211> 25 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 17 

accaccagga caaaggcgat tctga 



<210> 18 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
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<400> 18 

agtcgaacag tccagggccc 

<210> 19 

•'<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<4p0> 19 . . . 
cacgatctcg cacttggggg 



<210> 20 
<211> 32 

<212> DNA ' 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 20 

ctattcgatg atgaagatac cccaccaaac cc 



<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 21. 

ggaagagctg ggtgtggaat 



<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 22 

•tttcccgttg agatcctgct 



<210> 23 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 23 

acagacagca acaacgagaa 
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<210> 24 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 24 

aaaggaaggc agaggaagc 19 

<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> • " • 

<223> Description of Artificial Sequence: primer 

<400> 25 

acccctccac tgtctgtaag 20 

<210> 26 
<211> 5 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence; digestion 
peptide 

<400> 26 

Glu Leu Gly Ala Lys 

<210> 27 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 27 
Glu Leu Leu Lys 
1 

<210> 28 
<211> 6 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: digestion 
peptide 



<400> 28 

Asp Met Asn Phe Leu Lys 
1 5 
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<210> 29 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 29 

lie Ala Ser Pro Pro Ser Asp Phe ,Gly Arg 
1 ' - 5 10 

<210> 30 

<211> 4 • ■■ 

<212> PRT 

<213> Artificial Sequence 
<220>* 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 30 
Leu Leu Leu Asp 
1 

<210> 31 

<211> 13 . • 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 31 

Asp Phe Giu Asn Val Asp Asn Ser Asn Phe Ala Pro Arg 
15 10 

<210> 32 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 32 

Pro Thr Phe Phe Ala Arg 
1 5 

<210> 33 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 



<400> 33 

Leu Gin Phe Ser Glu Val Gly Thr Asp Xaa Asp Ala 
15 10 
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<210> 34 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 34 

Glu Leu Gly Ala Val Lys Pro Asp Gly Arg Leu 
■ 1 '5 10 

<210> 35 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 35 

Glu Leu Leu Lys Arg Lys Leu Glu Gin Gin Glu Lys 
1 .5 10 

<210> 36 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 



<400> 36 

Leu Gly Leu Leu Met Pro Glu Lys 
1 5 
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□ 



3. Claims Nos.: 
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see additional sheet 



1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
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